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Summary. Several statistical techniques are proposed for economically ana- 
jyzing large masses of data by means of punched-card equipment; most of these 
techniques require only a counting sorter. The methods proposed are de- 
signed especially for situations where data are inexpensive compared to the 
cost of analysis by means of statistically ‘efficient” or ‘‘most powerful” pro- 
cedures. The principal technique is the use of functions of order statistics, 
which we call systematic statistics. 

It is demonstrated that certain order statistics are asymptotically jointly 
distributed according to the normal multivariate law. 

For large samples drawn from normally distributed variables we describe 
and give the efficiencies of rapid methods: 

i) for estimating the mean by using 1, 2, ---, 10 suitably chosen order 
statistics; (ef. p. 386) 

ii) for estimating the standard deviation by using 2, 4, or 8 suitably chosen 
order statistics; (cf. p. 389) 

iii) for estimating the correlation coefficient whether other parameters of the 
normal bivariate distribution are known or not (three sorting and three 
counting operations are involved) (cf. p. 394). 

The efficiencies of procedures ii) and iii) are compared with the efficiencies of 
other estimates which do not involve sums of squares or products. 


1. Introduction. The purpose of this paper is to contribute some results 
concerning the use of order statistics in the statistical analysis of large masses 
of data. The present results deal particularly with estimation when normally 
distributed variables are present. Solutions to all problems considered have 
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been especially designed for use with punched-card equipment although for 
most of the results a counting sorter is adequate. 

Until recently mathematical statisticians have spent a great deal of effort 
developing “efficient statistics” and ‘most powerful tests.’’? This concentration 
of effort has often led to neglect of questions of economy. Indeed some may 
have confused the meaning of technical statistical terms “efficient” and “ef- 
ficiency”? with the layman’s concept of their meaning. No matter how much 
energetic activity is put into analysis and computation, it seems reasonable to 
inquire whether the output of information is comparable in value to the input 
measured in dollars, man-hours, or otherwise. Alternatively we may inquire 
whether comparable results could have been obtained by smaller expenditures. 
In some fields where statistics is widely used, the collection of large masses of 
data is inexpensive compared to the cost of analysis. Often the value of the 
statistical information gleaned from the sample decreases rapidly as the time 
between collection of data and action on their interpretation increases. Under 
these conditions, it is important to have quick, inexpensive methods for analyzing 
data, because economy demands militate against the use of lengthy, costly 
(even if more precise) statistical methods. A good example of a practical 
alternative is given by the control chart method in the field of industrial quality 
control. The sample range rather than the sample standard deviation is used 
almost invariably in spite of its larger variance. One reason is that, after brief 
training, persons with slight arithmetical knowledge can compute the range 
quickly and accurately, while the more complicated formula for the sample 
standard deviation would create a permanent stumbling block. Largely as a 
result of simplifying and routinizing statistical methods, industry now handles 
large masses of data on production adequately and profitably. Although the 
sample standard deviation can give a statistically more efficient estimate of the 
population standard deviation, if collection of data is inexpensive compared to 
cost of analysis and users can compute a dozen ranges to one standard deviation, 
it is easy to see that economy lies with the less efficient statistic. 

It should not be thought that inefficient statistics are being recommended for 
all situations. There are many cases where observations are very expensive, 
and obtaining a few more would entail great delay. Examples of this situation 
arise in agricultural experiments, where it often takes a season to get a set. of 
observations, and where each observation is very expensive. In such cases the 
experimenters want to squeeze every drop of information out of their data. 
In these situations inefficient statistics would be uneconomical, and are not 
recommended. 

A situation that often arises is that data are acquired in the natural course of 
administration of an organization. These data are filed away until the accumula- 
tion becomes mountainous. From time to time questions arise which can be 
answered by reference to the accumulated information. How much of these data 
will be used in the construction of say, estimates of parameters, depends on the 
precision desired for the answer. It will however often be less expensive to 





“INEFFICIENT” STATISTICS 379 


get the desired precision by increasing the sample size by dipping deeper into 
the stock of data in the files, and using crude techniques of analysis, than to 
attain the required precision by restricting the sample size to the minimum 
necessary for use with ‘‘efficient”’ statistics. 

It will often happen in other fields such as educational testing that it is less 
expensive to gather enough data to make the analysis by crude methods suf- 
ficiently precise, than to use the minimum sample sizes required by more refined 
methods. In some cases, as a result of Lhe type of operation being carried out 
sample sizes are more than adequate for the purposes of estimation and testing 
significance. The experimenters have little interest in milking the last drop of 
information out of their data. Under these circumstances statistical workers 
would be glad to forsake the usual methods of analysis for rapid, inexpensive 
techniques that would offer adequate information, but for many problems such 
techniques are not available. 

In the present paper several such techniques will be developed. For the 
most part we shall consider statistical methods which are applicable to estimating 
parameters. In a later paper we intend to consider some useful “‘inefficient”’ 
tests of significance. 


2. Order statistics. If asample O, = 11,%2,°** , 2 Of size n is drawn from 


a continuous probability density function f(x), we may rearrange and renumber 
the observations within the sample so that 


(1) % <M << oo CB 


(the occurrence of equalities is not considered because continuity implies zero’ 
probability for such events). The z,’s are sometimes called order statistics. 
On occasion we write x(z) rather than x;. Throughout this paper the use of 
primes on subscripted x’s indicates that the observations are taken without 
regard to order, while unprimed subscripted x’s indicate that the observations 
are order statistics satisfying (1). Similarly x(n;) will represent the n:th order 
statistic, while x’(n;) would represent the n;th observation, if the observations 
were numbered in some random order. The notation here is essentially the 
opposite of usual usage, in which attention is called to the order statistics by 
the device of primes or the introduction of a new letter. The present reversal 
of usage seems justified by the viewpoint of the article—that in the problems 
under consideration the use of order statistics is the natural procedure. 

An example of a useful order statistic is the median; when n = 2m + 1 (m = 
0,1, ---+), 2m+41 is called the median and may be used to estimate the population 
median, i.e. u defined by 


[10 a = 34. 


In the case of symmetric distributions, the population mean coincides with u 
and 2m4.1 will be an unbiased estimate of it as well. When n = 2m (m = 1, 2, 
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-++), the median is often defined as }(am + 2m41). The median so defined is 
an unbiased estimate of the population median in the case of symmetric dis- 
tributions; however for most asymmetric distributions 3(%m + 2m41) will only 
be unbiased asymptotically, that is in the limit as ” increases without bound. 
For another definition of the sample median see Jackson [8, 1921]. When z is 
distributed according to the normal distribution 


I _ ,— (1/202) (2—a)2 
V/ Ino ' 
the variance of the median is well known to tend to 2o’/2n as n increases. 

It is doubtful whether we can accurately credit anyone with the introduction 
of the median. However for some of the results in the theory of order statistics 
it is easier to give credit. In this section we will restrict the discussion to the 
order statistics themselves, as opposed to the general class of statistics, such as 
the range (x, — 21), which are derived from order statistics. We shall call 
the general class of statistics which are derived from order statistics, and use 
the value ordering (1) in their construction, systematic statistics. 

The large sample distribution of extreme values (examples 2, , ®,~s41 for r, 8 
fixed and n — ~) has been considered by Tippett [17, 1925] in connection with 
the range of samples drawn from normal populations; by Fisher and Tippett 
[3, 1928] in an attempt to close the gap between the limiting form of the dis- 
tribution and results tabled by Tippett [17], by Gumbel [5, 1934] (and in many 
other papers, a large bibliography is available in [6, Gumbel 1939]), who dealt 
with the more general case r > 1, while the others mentioned considered the 
special case of r = 1; and by Smirnoff who considers the general case of 2, , 
in [15, 1935] and also [16] the limiting form of the joint distribution of 2, , 2, , 
for rand s fixed asn — «. 

In the present paper we shall not usually be concerned with the distribution 
of extreme values, but shall rather be considering the limiting form of the joint 





N(a, a, 0°) = 


distribution of x(m), r(m), --+, x(x), satisfying 
. Ni ‘ 
ConpDlITION 1. lim — = \;; ¢=1,2,---, k; 
no nv 
Ar < Ae tte SN. 


In other words the proportion of observations less than or equal to x(n,) tends 
to a fixed proportion which is bounded away from 0 and 1 as n increases. K. 
Pearson [13, 1920] supplies the information necessary to obtain the limiting 
distribution of x(m,), and limiting joint distribution of x(m), x(me). Smirnoff 
gives more rigorous derivations of the limiting form of the marginal distribution 
of the a2(n;) [15, 1935] and the limiting form of the joint distribution of x(n;) 
and x(n,;) [16] under rather general conditions. Kendall [10, 1943, pp. 211-14] 
gives a demonstration leading to the limiting form of the joint distribution. 
Since we will be concerned with statements about the asymptotic properties 
of the distributions of certain statistics, it may be useful to include a short dis- 
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cussion of their implications both practical and theoretical. If we have a 
statistic 6(O,,) based on a sample On: 21, %2, °** : a, drawn from a population 
with cumulative distribution function F(x) it often happens that the function 
(6 — 0)/o, = yn, Where o, is a function of n is such that 


> ae - —jz* 
(A) lim Py, < t) = Van [. € dx. 
When this condition (A) is satisfied we often say: 6 is asymptotically normally 
distributed with mean @ and variance o,. We will not be in error if we use the 
statement in italics provided we interpret it as synonymous with (A). How- 
ever there are some pitfalls which must be avoided. In the first place condition 
(A) may be true even if the distribution function of y, , or of 6, has no moments 
even of fractional orders for any n. Consequently we do not imply by the itali- 
cized statement that lim E[6(O,)] = 0, nor that lim {[E(@) — [E(é)~} = 


n—20 n—>o 
o,, for, as mentioned, these expressions need not exist for (A) to be true. In- 
deed we shall demonstrate that Condition (A) is satisfied for certain statistics 
even if their distribution functions are as momentless as the startling distribu- 
tions constructed by Brown and Tukey [1, 1946]. Of course it may be the case 
that all moments of the distribution of 6 exist and converge as n > ~& to the 
moments of a normal distribution with mean @ and variance o,. Since this 
implies (A), but not conversely, this is a stronger convergence condition than 
(A). (See for example J. H. Curtiss [2, 1942].) However the important im- 
plication of (A) is that for sufficiently large n each percentage point of the 
distribution of 6 will be as close as we please to the value which we would compute 
from a normal distribution with mean @ and variance o°, , independent. of whether 
the distribution of 6 has these moments or not. 

Similarly if we have several statistics 6, , 6, ---, 6,, each depending upon 
the sample O,, : tes ee ** a r,, we shall say that the 6; are asymptotically jointly 
normally distributed with means 6, , variances o,(n), and covariances pijoio;, When 

lim P(y: < ti, yo < te, +++, Ye < tk) 


no 


(B) ty to tk " 
wf to f° aay a, 


v--o — 00 — 00 


where y; = (6, — 6:)/o;, and Q” is the quadratic form associated with a set of 
k jointly normally distributed variables with variances unity and covariances 
pj, and K is a normalizing constant. Once again the statistics 6; may not 
have moments or product moments, the point that interests us is that the 
probability that the point with coordinates (6,, 62, ---, 6%) falls in a certain 
region in a k-dimensional space can be given as accurately as we please for 
sufficiently large samples by the righi side of (B). 

Since the practicing statistician is very often really interested in the prob- 
ability that a point will fall in a particular region, rather than in the variance 
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or standard deviation of the distribution itself, the concepts of asymptotic | 
normality given in (A) and (B) will usually not have unfortunate consequences, 
For example, the practicing statistician will usually be grateful that the sample 
size can be made sufficiently large that the probability of a statistic falling into 
a certain small interval can be made as near unity as he pleases, and will not 
usually be concerned with the fact that, say, the variance of the statistic may 
be unbounded. 

Of course, a very real question may arise: how large must n be so that the 
probability of a statistic falling within a particular interval can be sufficiently 
closely approximated by the asymptotic formulas? If in any particular case 
the sample size must be ridiculously large, asymptotic theory loses much of its 
practical value. However for statistics of the type we shall usually discuss, 
computation has indicated that in many cases the asymptotic theory holds 
very well for quite small samples. 

For the demonstration of the joint asymptotic normality of several order 
statistics we shall use the following two lemmas. 

Lemma 1. If a random variable 6(O,) is asymptotically normally distributed 
converging stochastically to 0, and has asymptotic variance o (n) — 0, where n | 


is the size of the sample O, : t1, %2, °**, Ln, drawn from the probability density 

function h(x), and g(6) is a single-valued function with a nonvanishing continuous 

derivative g'(6) in the neighborhood of 6 = 0, then g(6) is asymptotically normally 

distributed converging stochastically to g(@) with asymptotic variance o’,{g'(0)\. 
Proor. By the conditions of the lemma 


, 6-6 . £ 1s 

lim P <t)= 7 | * du. 
Now if to, = A@, A@ = 6 — 86, using the mean value theorem there is a 6, in 
the interval [@, 6], such that 


g(6) = g(0) + (6 — 4)g’(H), 





which implies 


tim P(?—* <4) = tim (MOO <1), g'(6:) ~ 0, 








noo On no ong (01) 
where 6 is a function of n. However lim g’(@:) = g’(@) so we may write 
46-0 
lim p(° er a ) = tim P (80 — 90 < ‘), g'(0) ¥ 0. 
n->0o on no Ong (@) 


where the form of the expression on the right is the one required to complete 
the proof of the lemma. 

Of course if we have several random variables 6; , 62, ---, 6%, we can prove 
by an almost identical argument that 

Lemma 2. If the random variables 6;(O,) are asymptotically jointly normally 


' 
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distributed converging stochastically to 6; , and have asymptotic variances 0; (n) > 0, 


and covariances p;jo;0 ; , where n is the size of the sample O, : ry , rs orn x, drawn 
from the probability density function h(x), and g:i(6;), i = 1, 2, ---, k, are single- 
valued functions with nonvanishing continuous derivatives g:(6:) in the neighbor- 
hood of 6; = 0; , then the g;(6;) are jointly asymptotically normally distributed with 
means gi(9:), varzances otlg:(0:))° and covariances pi joie i9:(9:)93(0 i). 

The following condition represents restrictions on the probability density 
function f(x) sufficient for the derivation of the limiting form of the joint dis- 
tribution of the x(n;) satisfying Condition 1. 

ConpITION 2. The probability density function f(x) is continuous, and does not 
vanish in the neighborhood of u; , where 


[ t@ a =», $= 1,2,---,&. 


If we recall the discussion of condition (B) above, the theorem of Pearson 
and Smirnoff may be stated: 

THEOREM 1. Jf a sample O,: 21, 2, ***, Xn is drawn from f(x) satisfying 
Condition 2, and if x(m), x(ne) satisfy Condition 1 as n — «, then x(n), x(ne) 
are asymptotically distributed according to the normal bivariate distribution with 
Means U1, Ue, 


[- f@ ar = x, 


and variances 


. AC _— Aj) 


(= Ss, C= 1,2, 
oF AUP 
and covariance 
p1290102 = dr(L_ — Xe) a r2) 
ee fas) f(a) * 


Theorem 1 has an obvious generalization which seems not to have been carried 
out in the literature. The generalization may be stated: 

THEOREM 2. If a sample O,: 11, %2, ++, tn ts drawn from f(x) satisfying 
Condition 2, and if x(m), x(ne), --+, x(x) satisfy Condition 1 as n > ~, then 
the x(n:), 7 = 1, 2, ---, k, are asymptotically distributed according to the nor- 
mal multivariate distribution, with means u; , 


[[1@ dx = \, 


and variances 
a vx (1 ore dx) 


0; nftuy ’ t= 1,2,---,k, 
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and covariances 


Ai(1 — Aj) . : 
pijoi0; = ——— > 1lsti<jsk. 
nf (ui) f(y) 
Proor. We shall carry out the demonstration for the uniform distribution 
| 1O<z2z <1, 
f(x) = 


\0, elsewhere, 
and then utilize the fact that by a suitable transformation of the uniform dis- 
tribution we may get any f(x) satisfying Condition 2. Of course for the par- 
ticular case of the uniform distribution all moments of the x(n;) exist and con- 
verge to those of the asymptotic theory. 
The joint probability density of the x(n;), satisfying Condition 1 and drawn 
from f(x), is given by 


n! 


(m—1)!(n— mm)! I] @ — a1 - 1)! 
t=2 


x(n) n,—1 1 n—ny, k xz(nj) “|ng—ng-)-1 
([ at) (/ diss) II | ats 5 
0 x(nx) i=2 z(nj—1) 


Performing the indicated integrations we get from the right of (2) 





gla(ni), x(n), «++, x(mx)] = 


(2) 


(3) Cx(n,)" I [x(n — a(n,a)I"*-"—f — a(n)", 


where C is the multinomial coefficient on the right of (2). It is well known 


‘ iad ‘ . Ni . ni . 
that for the uniform distribution E[x(n;)| = a , or asymptotically —, t= 
1, 2, ---,&. We make the transformation y; = (x(n _ *:) Vn, leading to 








y m1 Ya nm,—l & e - Ni-1 [ys — = 
” ( . Jz) II n + Vn 
sm _ we) 
n /n 


Using the usual technique of factoring out expressions like 


ou —_ —l 
nN — m-1\"" **" 
n ’ 


; ; . ni 
we rewrite (4) with C2 as a new constant, and setting \; = — 
n 


Yi ny—1 
C2 ¢ + in) 


(4) 


(5) 





_&e — ge) \*O* — Yk _ 
“Hl (1 *é, = ova) € a= we 


eS N SEES 
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; . : 1 j 
Now taking the logarithm of (5), expanding, neglecting terms o(z) and higher, 


collecting terms and taking the antilogarithm we get the approximate asymp- 
totic distribution of the order statistics 


g(x(m1), (me), +++, x(mx)) 

(6) | - 42 y’ _ Ay Yiyi Sete} 

—_ C e Ni+ t—1 — tgi-— | 

: ” dv * Agua — Na) (Ax i Ai-1) : Ai oe} Ai-1 
where Ay = 0, Ax+1 = 1. Now setting up the matrix of the coefficients of the 
quadratic expression in the exponent 
Negi — Ai-1 1 
Aii 2 ee areas A;i- = A, ,eEeeCrre—,, 

(Aiea — Az)(AG — Asa) * : ’ Ne — Nees 
i=1,2,---,k; Ai; =0,|¢-—j|> 1. To obtain the variances and covar- 
jances we need 
cofactor of Aj; in || A; || 

determinant Aj; 
(see for example Wilks [18, p. 63 et seq.]). Now 
k+1 1 
(7) |A| = determinant A;; = [] ———; 
1 Nim N-1 
cofactor of Ay = A.(1 — Ax) | Al, 7 = 1, 2, ---, k. 
A(L—A)|Al, t< J 


A(L—-—A)|Al G<t 


Ad = 


ecfactor of A;; = 


This completes the proof for the uniform distribution. 

If the uniform distribution is transformed into a probability density function 
f(x) satisfying Condition 2, by an order preserving transformation, we appeal 
to Lemma 2. We notice that the x(n;) are transformed into g[x(n;)], and that 
the probability that x(n;) falls in the interval [u,; , u; + Au] is transformed into 
the probability that g[x(n;)] falls in the interval [g(ui), g(u; + Au;)]. Using 
the mean value theorem we may write 


g(ui + Au:) = g(ui) + Auig’(ui), 


where w; lies in the interval [u;, w; + Au]. However 
lim g’ (u;) = g' (ui). 
Au;-0 


The density for the uniform distribution in the interval [u;, u; + Au] is just 
Au;, and this same density will tend to f(u;)Awig’(ui). Therefore g’(u;) = 
1/f(u;), which completes the proof of Theorem 2. 

It would often be useful to know the small sample distribution of the order 
statistics, particularly in the case where the sample is drawn from a normal. 
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Fisher and Yates’ tables [4] give the expected values of the order statistics up 
to samples of size 50. However it would be very useful in the development 
of certain small sample statistics to have further information. It is perhaps too 
much to expect tabulated distribution functions, but at least the variances 
and covariances would be useful. A joint effort has resulted in the calculation 
for samples n = 2, 3, ---, 10 of the expected values to five decimal places, 
the variances to four decimal places, and the covariances to nearly two decimal 
places. It is expected that these tables will be published shortly. 


3. Estimates of the mean of a normal distribution. It will be important 
in what follows to define efficiency and to indicate its interpretation. Then we 
shall construct some estimates of the means of certain distributions and compute 
their efficiencies. Except for the tables given, the discussion is applicable to 
the estimation of the mean of any symmetric distribution; and, of course, the 
concept of efficiency is still more general in its application. A statistic 6(0,), 
where O, is the sample, is said to be an efficient estimate of 6 if 


i) Wn (6 — 6) is asymptotically normally distributed with zero mean and 
finite variance, o (6), and 

ii) for any other statistic 6’ with ~/n(6’ — 6) asymptotically normally dis- 
tributed with zero mean and variance o°(6’), o°(6) < o°(6’). 


The ratio o'(6)/o°(6’) is termed the efficiency of 6’ if 6 is an efficient estimate 
of 6. For discussion see Wilks [18, 1943]. The concepts of efficient statistic 
or estimate and of efficiency were introduced by R. A. Fisher. They serve as 
one measure of the amount of information a statistic draws from a sample. 
It is also common practice to speak of relative efficiencies, for example, of the 
statistics 6’ and 6” described in ii) above, we say if o°(6’) < o°(6’’) that the 
efficiency of 6’ relative to 6’ is the ratio of the smaller variance to the larger. 
This concept of efficiency has sometimes been used when the normality assump- 
tion has been violated by one or both statistics, when one or both are biased, 
and when small samples are considered. When used under these conditions 
the concept of efficiency becomes more difficult to interpret, although a compari- 
son of the variation of two statistics about the value they are commonly esti- 
mating is often of value. 

In the case of estimates of the mean a of a variable which is normally dis- 
tributed according to N(x, a, 0°) from a sample of n, we can often express the 
variance of an asymptotically unbiased estimate as o°(6;) = kio’/n. The sample 
mean @ = Yz;,/n is an efficient estimate of a with variance o’/n. Then in such 
cases the efficiency of 6; in estimating a is 1/k;. The interpretation is merely 
that to obtain the same precision using 6; as is possible with 6, one must use 
a sample k; times as large. 

Bearing in mind that we are at present searching for economical methods 
for analyzing large samples, it is clear that the concept of efficiency offers us a 
practical way of comparing cost of information with cost of obtaining it. 
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In the present section and in sections 4 and 5 we shall develop certain sys- 
tematic estimates of parameters of normally distributed variables. Our pro- 
cedure then will be to compare the efficiency of the systematic estimates with 
the efficient statistic for estimating the parameter in question, and also in sec- 
tions 4 and 5 we compare our estimates with a statistic not involving squares 
or products. Of course the efficient statistic for estimating the mean of a normal 
is the sample mean, therefore in this section we will only compare our estimates 
with the sample mean. 

We can construct unbiased estimates of the mean of a normal distribution 
from linear combinations of suitably chosen order statistics. These systematic 
statistics will be asymptotically normally distributed if the order statistics 
from which they are derived satisfy Condition 1. We will restrict ourselves to 
a useful practical case where equal weights are used. In other words the esti- 
mate discussed is just the average of k order statistics kK ’Sa(n;). Suppose 
a(n;), i = 1, 2, ---, k satisfy Condition 1, that E[x(n;)] = Ela(nz-i+:)], so that 
E{=x(n;)] = a. An important unsolved question is to discover what spacing 
of the x(n;) will yield minimum variance, and thereafter at what rate does the 
efficiency of this optimumly spaced estimate increase with k. Computational 
methods bog down rapidly after k = 3. Because so little is known about this 
problem it seems worthwhile to offer some results for three arbitrary spacings 
(these results are of course useful in analyzing data). 

If the x(n;) satisfy Theorem 2 we may approximate the variance of the sys- 
tematic statistic 6, = Za(n:)/k by the usual formula 


(8) o°(6.) = ElZx(n,)/k) — (E(2a(ns)/k)P. 


We lose no generality by assuming the mean and variance of the underlying 
normal to be 0 and 1 respectively. Then using the fact that Du; = 0, and 
the result of Theorem 1 we rewrite (8) as 


oe A(1 — Ax) As(1 — Aj) 
@) ob) = Bizetm) — wet = 2 | MOM pa MEM), 
where fm = f(um). 

Using the symmetry which makes \; = 1 — Ax-i4n, fi = Se-izn, and the fact 
that for k = 2r + 1, fra = 1/~/2z, Ar41 = 3, we may simplify the right side 
of equation (9) with the following results for k = 1, 2, ---,7. The factor 1/k’ 
has not been disturbed. We also write the general formulas for the simplified 
form of (9), but we omit a rather lengthy combinatorial argument which es- 
tablishes the generalization. 
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In addition to the possibility of minimizing the equations of (10) by numerical 
methods, three other procedures suggest themselves: i) to space the order 
statistics uniformly in probability; ii) to choose those k order statistics whose 
expected values are equal to the expected values of the order statistics in a 
sample of size k drawn from a unit normal; iii) to choose 4; = (¢ — 3)/k. The 
following table lists for k = 1, 2, and 3 the expected values wu; of the order sta- 
tistics and the probability to the left of the expected values \; for each of the 
procedures. The chosen order statistics are counted from left to right. It 
will be noticed that the third method gives very good results, and has the value 
of simplicity of formula. The following table gives a comparison between the 
efficiencies resulting from spacing by the three methods. The three optimum 
cases are included for completeness. 

Statisticians planning to use the method of expected values suggested above | 
will find Fisher and Yates [4, 1943] table of the expected values of the order 
statistics in samples of size k drawn from a unit normal helpful for computing 
the A;. Alternatively the following table of \; might be used. 

As an example of the use of Table III, suppose we are using the expected 
value method for estimating the mean of a large sample drawn from a normal 
distribution N(z, a, 0°). If we are willing to use 6 observations out of 1000 for 
this purpose Table III indicates the selection of 2103 , 2261 , Xa21 , Xss0, L740 , Xess: 
Furthermore Table II indicates that the variance of the estimate of a based 
on the average of these six observations will be approximately o”/.948n, n = 1000. 
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4. Estimates of the standard deviation. The statistic 
& =D (xi — #)*/(n — 1), 
i=1 


TABLE I 


Comparison of the order statistics which would be chosen according to each of the 
four procedures for subsamples of k = 1, 2, 3 
























































. Equal 
‘ Order Optimum Probability Expected Values | i= (t—4)/k 
GS — — | —___________ | _— 

Ui ri Uj | Ni Uy ri Ui | XN 
l First .0000} .5000! .0000) .5000) .0000) .5000} .0000).5000 
2 | First | —.6121| .2702|—.4307| .3333| — .5642) .2863) — .6745).2500 
| Second | .6121|) .7298) .4307| .6667| .5642 7137) .6745) . 7500 
3 | First = .9056| .1826| — .6745| .2500| — .8463) .1967| — .9674| . 1667 
Second -0000| :5000; .0000) .5000; .0000) .5000) .0000/.5000 
Third | .9056) .8174| .6745|) .7500) .8463) . 8033} : 9674) . 8333 





TABLE II 


Comparison of the efficiencies of four methods of spacing k order statistics used 
in the construction of an estimate of the mean 











k rmi/(k+i1) | Rapeetes = (i—¥)/k | Optimum 
I .637 | 637 | .637 .637 
2 .793 | .809 | .808 .810 
3 | .860 | .878 .878 .879 
4 | .896 | 914 | .913 

5 | .918 | .933 | .934 

e* | .933 | .948 | .948 

7 | .944 | .956 .957 

8 | .952 | 963 | .963 

9 | .957 | .968 | .969 | 
10 | .962 | .972 .973 











* The u; are chosen equal to the expected values of the order statistics of a sample of 
size k. 


where @ = ou az;/n is well known to be an unbiased estimate of the popula- 
tion variance o’, for n > 1. However s is not in general an unbiased estimate 
of oc. We are not interested here in the question of when we should estimate o 
and when it is more advantageous to estimate o. All we want is to have an 
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unbiased estimate of o, based on sums of squares, to compare with another 
unbiased estimate based on order statistics. In the case of observations drawn 
from a normal distribution 


, — Gn)'TQln — 1) fae 
” Sh 


is an unbiased estimate of o (see for example Kenney [11], with variance 


(12) o(s’) = (3 | Pate =] (n—1)— ihe 


TABLE III* 


P(x < ui) X 104, uijx = Erie), Lijx ts the ith order statistic in a sample of 
size k drawn from a normal distribution N (x, 0, 1) 


1987 | 5000 

1516 | 3832 | 6168 
1224 | 3103 | 5000 | 6897 | 8776 

1025 | 2605 | 4201 | 5799 | 7395 | 8975 

0881 | 2244 | 3622 | 5000 | 6378 | 7756 | 9119 

0773 | 1971 | 3182 | 4394 | 5606 | 6818 | 8030 | 9227 

0688 | 1756 | 2837 | 3919 | 5000 | 6082 | 7163 | 8244 | 9312 

0619 | 1584 | 2559 | 3536 | 4512 | 5488 | 6464 | 7441 | 7416 | 9381 


1 
2 
3 
+ 
5 
6 
7 
8 
9 
0 


1 





* The table is given to more places than necessary for the purpose suggested because it 
may be of interest in other applications. The E(z;|.) from which the table was derived 
were computed to five decimal places. 


For most practical purposes however, when n > 10, the bias in s is negligible. 
For large samples o°(s’) approaches o”/2n. 


4A. The range as an estimate of oc. As mentioned in the Introduction, 
section 1, it is now common practice in industry to estimate the standard devia- 
tion by means of a multiple of the range R’ = c,(z, — 21), for small samples, 
where c, = 1/[E(yn) — E(y:)], yn and y; being the greatest and least observations 
drawn from a sample of size n from a normal distribution N(y, a, 1). Although 
we are principally interested in large sample statistics, for the sake of complete- 
ness, we shall include a few remarks about the use of the range in small samples. 

Now R’ is an unbiased estimate of o, and its variance may be computed for 
small samples, see for example Hartley [7, 1942]. In the present case, although 
both R’ and s’ are unbiased estimates of o, they are not normally distributed, 
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nor are we considering their asymptotic properties; therefore the previously 
defined concept of efficiency does not apply. We may however use the ratio 
of the variances as an arbitrary measure of the relative precision of the two 
statistics. The following table lists the ratio of the variances of the two sta- 
tistics, as well as the variances themselves expressed as a multiple of the popu- 
lation variance for samples of size n = 2,3, ---, 10. 


4B. Quasi ranges for estimating o. The fact that the ratio o°(s’)/o (R’) 
falls off in Table IV as n increases makes it reasonable to inquire whether it 
might not be worthwhile to change the systematic estimate slightly by using 
the statistic ¢i,[%,1 — x2], or more generally ¢,\n[%a-, — 2,41] where c,), is the 
multiplicative constant which makes the expression an unbiased estimate of o 
(in particular c,), is the constant to be used when we count in r + 1 observations 
from each end of a sample of size n, thus ¢rjn = 1/[E(yn-- — yri1)] where the 


TABLE IV 


Relative precision of s’ and R’, and their variances expressed as a multiple of o?, 
P P P 
the population variance 


o*(8’)/o*(R’) o*(s')/o? o*(R’)/o? 


1.000 570 570 
.990 273 276 
977 178 182 
.962 132 137 
932 104 112 
910 .0864 .0949 
.889 .0738 .0830 
9 869 .0643 .0740 

10 851 | .0570 .0670 





y’s are drawn from N(y, a, 1)). This is certainly the case for large values of n, 
but with the aid of the unpublished tables mentioned at the close of section 2, 
we can say that it seems not to be advantageous to use ¢1;n[%,-1 — x2] for n < 10. 
Indeed the variance ¢i,10[%) — 22], for the unit normal seems to be about .10, 
as compared with o(R’)/o° = .067 as given by Table IV, for n = 10. The 
uncertainty in the above statements is due to a question of significant figures. 

Considerations which suggest constructing a statistic based on the difference 
of two order statistics which are not extreme values in small samples, weigh 
even more heavily in large samples. A reasonable estimate of o for normal 
distributions, which could be calculated rapidly by means of punched-card 
equipment is 


(13) $= : [x(n) — 2(m)], 
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where the x(n;) satisfy Condition 1, and where c = w— uw, uw and wy are the 
expected values of the nz and m order statistics of a sample of size n drawn from 
a unit normal. Without loss of generality we shall assume the x; are drawn 


3. @ 


‘ ‘ Ne 
from a unit normal. Furthermore we let — = AX» = 1 — \; = 1 
n n 


course o will be asymptotically normally distributed, with variance 


(14) (6) = = | Me Dg SE) 1 = me 

ne? [f(m)? [f(ue)? S(ur)f(uz) 
Because of symmetry f(w) = f(w); using this and the fact that A: = 1 — do, 
we can reduce (14) to 
2 Ax(1 — 2d1) 
ne [f(m)} © 
We are interested in optimum spacing in the minimum variance sense. ‘The 
minimum for o (6) occurs when \; = .0694, and for that value of \1, 0° (¢) = 
.767 o°/n. Asymptotically s’ is also normally distributed, with o”(s’) = o°/2n. 
Therefore we may speak of the efficiency of ¢ as an estimate of o as .652. It is 
useful to know that the graph of o’(é) is very flat in the neighborhood of the 
minimum, and therefore varying \; by .01 or .02 will make little difference in 
the efficiency of the estimate ¢ (providing of course that c is appropriately 
adjusted). K. Pearson [13] suggested this estimate in 1920. It is amazing that 
with punched-card equipment available it is practically never used when the 
appropriate conditions described in the Introduction are present. 

The occasionally used semi-interquartile range, defined by \,; = .25 has an 
efficiency of only .37 and an efficiency relative to ¢ of only .56. 

As in the case of the estimate of the mean by systematic statistics, it is per- 
tinent to inquire what advantage may be gained by using more order statistics 
in the construction of the estimate of o. If we construct an estimate based on 
four order statistics, and then minimize the variance, it is clear that the extreme 
pair of observations will be pushed still further out into the tails of the dis- 
tribution. This is unsatisfactory from two points of view in practice: i) we will 
not actually have an infinite number of observations, therefore the approxima- 
tion concerning the normality of the order statistics may not be adequate if A, 
is too small, even in the presence of truly normal data; ii) the distribution 
functions met in practice often do not satisfy the required assumption of norm- 
ality, although over the central portion of the function containing most of the 
probability, say except for the 5% in each tail normality may be a good approxi- 
mation. In view of these two points it seems preferable to change the question 
slightly and ask what advantage will accrue from holding two observations at 
the optimum values just discussed (say 4, = .07, Xx = .93) and introducing 
two additional observations more centrally located. 

We define a new statistic 


(15) o(é) = 


(16) a? = 5 [e(m) + 2m) — 2(m) — 2(m)] 
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ce’ = Efx(ns) + x(n3) — x(me) — x(m)], where the observations are drawn from 
aunit normal. We take \; = 1 — \4,A2 = 1 — As, A, = 07. It turns out that 
o (é’) is minimized for 2 in the neighborhood of .20, and that the efficiency com- 
pared with s’ is a little more than .75. Thus an increase of two observations in 
the construction of our estimate of o increases the efficiency from .65 to .75. 
We get practically the same result for .16 < A, < .22. 

Furthermore, it turns out that using \; = .02, +» = .08, As = .15, Ay = .25, 
hs = -75, Ae = .85, Ay = .92, As = .98, one can get an estimate of o based on 
eight order statistics which has an efficiency of .896. This estimate is more 
efficient than either the mean deviation about the mean or median for esti- 
mating o. The estimate is of course 


a” = [x(ns) + x(nz) + x(m6) + x(ms) — 2(m4) — 2(ns) — x(n) — x(m)]/C, 
where C = 10.34. 


To summarize: in estimating the standard deviation o of a normal distribution 
from a large sample of size n, an unbiased estimate of o is 


A 


1 
= C (Sa~041 ee Le), 
where c = E(Yn-r41 — Yr) where the y’s are drawn from N(y, a, 1). The estimate 
¢ is asymptotically normally distributed with variance 
2 Au(1 — 2A) 
ne? [f(w)P ’ 


where \. = r/n, f(m) = N(E(z,), 0, o°). We minimize o°(é) for large samples 
when \, = .0694, and for that value of  , 


7670" 


oopt (3) = _ 


o(é) = 





The unbiased estimate of o 


Al 


1 
"> (Sara + Le-~o41 — Ze — 2) 


may be used in lieu of ¢. If iy = r/n, 2 = 8/n we find 





- 660° 
os 


o (6 | A — .07, de = .20) 


4C. The mean deviations about the mean and median. The next level of 
computational difficulty we might consider for the construction of an estimate 
of « is the process of addition. The mean deviation about the mean is a well 
known, but not often used statistic. It is defined by 


(17) md. = >>| a2; — £|/n. 
i=l 
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For large samples from a normal distribution the expected value of m.d. is 


V2 a, therefore to obtain an unbiased estimate of o we define the new statistic 


A= 5 md. Now for large samples A has variance o [}(4 — 2)]/n, or an 


efficiency of .884. However there are slight awkwardnesses in the computation 
of A which the mean deviation about the median does not have. 

It turns out that for samples of size n = 2m + 1 drawn from a normal dis- 
tribution N(y, a, 0) the statistic 


Pc © 2, | Li — tm | 
(18) 


asymptotically has mean o and variance 
2 os l oo 2 2 
(19) o (M’) = et 9 )e ‘ 

Thus in estimating the standard deviation of a normal distribution from 
large samples we can get an efficiency of .65 by the judicious selection of two 
observations from the sample, an efficiency of .75 by using four observations, 
and an efficiency of .88 by using the mean deviation of all the observations from 


either the mean or the median of the sample, and an efficiency of .90 by using 
eight order statistics. 





5. Estimation of the correlation coefficient. In the present section we con- 
sider the estimation of the correlation coefficient of a normal bivariate population: 








1 
x, es 
(20) f(z, 9) QrozoyW/1 — p? 
wp) — be (= 4 YH BF _ Bale = ally ~ ») 
P 21 — p) - oy Oz Sy " 
The efficient estimate of p in a sample O, : (x1, yi), (25 Y2)) °° *> (ny Yn) drawn 
from the density (20) is 
(21) , - D(z; — Zz yi Y) 


LG: — 2° La — NT 
There are numerous other techniques in the literature for estimating p, among 
them i) the tetrachoric correlation coefficient which depends on a four-fold table, | 
ii) the adjusted rank correlation coefficient which depends on assigning ranks to | 
the x and y observations. These and other estimates of the correlation co- 
efficient are discussed by Kendall [10]. 

We shall be concerned with the construction of some estimates of the cor- | 
relation coefficient which are particularly adapted for use with punched-card | 
equipment. A counting sorter is adequate for the first two cases discussed; 
in line with our previous development we shall then consider a technique which 
uses simple addition of the observed values, but does not require sums of squares | 
or products (in the special case where variances of x and y are equal). 
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5A. Estimation of p when means and standard deviations are known. Let 
us suppose that the means and variances of the variables x and y, distributed 
according to (20) are given, and consider the problem of estimating the cor- 
relation coefficient p from a sample of size n. There will be no generality lost 
by assuming a = b = 0, o; =o, =1. The technique used will be to construct 
lines y = 0, x = +k, which cut the zy-plane into six parts. We will form an 
estimate of p based upon the number of observations falling in the four corners. 
Figure 1 represents the lines laid out in the manner suggested in connection with 
a scatter diagram of 25 observations; naturally the method is recommended for 


n,=2 





oO 


n,=1 


| 
| 

o ° 
| 


Xm a-koy. x=a x= atko, 


Fic. 1. DIAGRAM OF THE CONSTRUCTION DESCRIBED IN PARAGRAPH 5A WITH A SAMPLE OF 
25 OBSERVATIONS SUPERIMPOSED 
use only with large samples, the 25 observations are for purposes of illustration 


only. More specifically after assigning the special values mentioned immedi- 
ately above to the means and variances in (20), we define 


n=[ [sever m=[ [ sewdedy 
(22) p=[ [ tevded, m=[ [sev aeay, 


Ps = [ Ric y) dx dy = [xe 0, 1) dz. 
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We denote by n, the number of observations falling into the region containing 
probability density p;. Of course > 312; =n. Now we may write the joint 
probability distribution of the n; as 


al J 
(23) g(m, Ne, N3, MN) = 7s. | II p?* . 


1 Mi! G21 
4 
remembering that n» = n — >, nj. 
1 


We shall now derive the maximum likelihood estimate of p from (23). Taking 
the logarithm of (23) we have 


5 
(24) log g = loge + >on; log Di 
t=1 
where c is the multinomial coefficient on the right of (23). Differentiating (24) 
with respect to p gives 
4 e 
dp i=l Di 


where pi; = cP * ; of course es = 0 because p; is functionally independent of p. 
p p 





To get p, the maximum likelihood estimate of p, under our restrictions, we must 
equate the right of (25) to zero and solve for p. Before proceeding it will be 
useful to note the following relations: 


Pi = Ps; P2 = Pa 
(26) ~Pi = —Ps ; Pr 


— ps; Pi = ps; P2 = Ps 
20 —k 

D+ Mm = / N(xz,0,1l)dz =A; pt p= [ N(z, 0, 1)dx = X. 
k r) 


If after making appropriate substitutions from (26) we set the right of (25) 
equal to zero we get 
Mp1 __ Mp MsPi Mar 
Pr A— pi Pi A— pi 





and since in general 7, ~ 0, the condition is that 


Ne + M h— pi 





(27) N + Ns —_ Pr 


Unless all four of the n; are zero (which is unlikely for reasonable values of \ 
because 7 is large), it is possible to find a value of p which will make the right 
side of (27) equal to the ratio formed from the observations on the left, and 
the value of p so determined is the maximum likelihood estimate p under the 
restrictions we have imposed. In practice this equation may be solved by con- 





I —_— cr 
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sulting a table of the bivariate normal distribution—see for example K. Pearson 
(14]. Alternatively [27] may be solved by referring to Figure 3. Truman Kelley 
(9, 1939] has considered a closely related problem in connection with the valida- 
tion of test items. 

It may be inquired whether it would not be preferable to reduce the present 
design to a tetrachoric case by using only the cutting lines x = 0, y = 0. An 
investigation of the variance of f reveals that such is not the case. We proceed 
to determine the asymptotic variance by means of the usual maximum likelihood 
technique. Differentiating (25) once more we have 








d'(log g) - ni(pi pi — pi) 
” a 2 ee’ ot Var | 
dp - VA?) 
where ji = ag . We note that E(n:;) = np;, therefore 
d' (log g) _ pi 
# (255°) - » Sa -E5) 


but since the derivative of a sum is equal to the sum of its derivatives, and 
pi + ps = A, P2 + ps = X, the first sum in the square brackets vanishes. Suit- 
able substitutions from (26) will reduce the second sum so that we get 





, | d (log a | Qnpir 
30 —E | 8 
= dp* pi(d — pr) 
Therefore asymptotically f is normally distributed with variance 
2¢2) _ P(A — Pr) 
(31) 70) = Oa 


In general the optimum value (in the minimum variance sense) of \ which deter- 
mines the cutting lines x = +k will depend on the true value of p. To carry 
out the minimization process in general will require fairly extensive computa- 
tions, which we feel would be justified. For the present we shall restrict our- 
selves to minimizing o°(p) for the case p = 0. 

We have 


— 1 = ue r 
Pi = 5- exp | 3k] = Vaz): 


when p = 0, and pi: = 3A. This gives 


wr 


(32) o(6|p =0) = ani f(y 


We wish to minimize the expression on the right. We recall that a similar 
expression )i/f; was to be minimized in section 3 when the optimum pair of 
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observations for estimating the mean of a normal distribution was found. 
Using the previous results we have \ = .2702, k = .6121; which gives us finally 


1.93 
(33) oon (5 |p = 0) = 


To summarize: if a sample of size n 1s drawn from a normal bivariate popula- 
tion with known means az, ay and variances o2 and o; , but unknown correlation p, the 
maximum likelihood estimate of p based on the number of observations falling in 
the four corners of the plane determined by the lines x = az + koz, y = ay ts found 
by solving for p the equation 


se 
Mm + M2 + M3 + M% SS 


where mn, 1s the number of observations falling in the upper right, nz in the upper 
left, ns in the lower left, ns in the lower right hand corner, and p; is the probability 
density in the region into which the n; fall, \ = pi + ps. The variance of this 
estimate p is given by 


o*(p) = pid — pr) 
Qndpr” 
which is minimized for p = 0 by setting k = .6121, X = .2702, giving 


es . 1.939 
Fopt (P| p = 0) = ——* 

On the other hand if the usual tetrachoric estimate is used with x = 0, y = 0 
as the cutting lines we get o7(p|p = 0) = 2 /4n. The relative efficiency of 
the tetrachoric compared with the optimum statistic is therefore .787. The 
variance of the efficient estimate r given in (25) when p = Ois1/n. Consequently 
the efficiency of our estimate p compared to that of ris about .515 for the special 
case p = 0 under consideration. This means about twice as large a sample is 
required to get the same precision with f as with r. Doubling the sample and 
using the cruder statistic may often be an economical procedure. 

It may be surmised that a still better estimate of p could be constructed by 
employing four cutting lines, say x = +k, y = +k. The simplifications which 
we used to obtain the estimate # no longer hold when we use this new construc- 
tion. However, it is still possible to compute the minimum variance of the 
new estimate which we will call f’, for the special case p = 0. It again turns 
out that k = .6121 minimizes and we get 


- , 1.52 
(34) copt(p’ | p = 0) = “-_ 
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which makes the efficiency of p’ (compared with r) about .66 as compared with 
515 for p. This suggests that if some very simple technique can be found for 
-f 


obtaining fp’, p’ would be worth using. Unfortunately the author has not 
been able to construct a rapid way of finding 9’. 


5B. Estimation of p when the parameters are unknown. A more practical 
situation than the case treated in paragraph 5A, is the case in which all param- 
eters of (20) are unknown. This case will be treated by means of order sta- 
tistics. We construct an order statistic analogue of the estimate p which we 
will call p*. In general the procedure will be as follows: Each of the N observa- 
tions in the sample has an x coordinate and a y coordinate 

i) order the observations with respect to the x coordinate; 

ii) discard all observations except the n with the largest x coordinates called 
the right set and the n with the smallest x coordinates called the left set, retain- 
ing, therefore, 2n observations; 

iii) order the pooled 2n observations with respect to the y coordinate; 

iv) break the 2n observations into two sets of n observations each; the upper 
set containing the n observations with the greatest y coordinates, and the 
lower set containing the n observations with the smallest y coordinates; 

v) reorder the upper set of observations with respect to the xz coordinate; the 
n Observations will be divided into those whose x coordinates belong to the 
right set and those whose x coordinates belong to the left set; 

vi) the estimate f* will be obtained by solving the equation 


n* * 
(35) i = .. 
on — ny Ai — Di 


*. ° . . 
where n; is the number-of observations in the wpper set which are also numbers 


of the right set and p; is [ / f(x, y)dx dy, while f(x, y) is the bivariate 
0 Jie 


normal (20) with c, = o,, = 1,a = b = 0, and / N(a, 0, 1) dx = x =r. 
ke 


Figure 2 represents graphically the construction described above for a scatter 
diagram composed of 25 observations. Of course the number 25 is only for 
purposes of illustration, as the method is only proposed for use with large samples. 

The procedure of ordering the x’s and choosing the right and left sets of ob- 
servations is analogous to cutting the bivariate distribution by the two lines 
xz = +k as described in paragraph 5A, indeed x = 2,4; and x = zy_, are the 
corresponding lines, but they vary from sample to sample. To continue the 
analogy, ordering the remaining observations with respect to y and dividing 
them into upper and lower sets of equal size is like cutting the plane with the 
line y = 0. Finally formula (35) is analogous to formula (27). Another similar 
change is that where formerly we had among relations (26) the equalities p; = 
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23, P2 = ps, We now have the corresponding relations amongst tl the number of 
observations in the four corners of the plane, namely nmi = 3, = Ns which 






n,* =2 n,"=4 


s né 
O nan, +n,= 


° 


n,*"=Ne2n-2 =11 


2 a s * 
n=n, +n; =6 X;=x Ryo=X Men, +n, =6 


Fic. 2. DIAGRAM OF THE CONSTRUCTION DESCRIBED IN PARAGRAPH 5B ON THE Basis oF 25 


OBSERVATIONS n = 68 


can readily be seen by inspection of the fourfold table we have constructed below 
(omitting all reference to N — 2n pairs of observations we have discarded). 








| Left set Right set Totals 
; * 
a a a ke eg | ma na 


| 
he ain belts da heey x alee | ne a ny 
: 
NS ee Ode ha cl ape onda | | 


We have dwelt at length upon the analogy between the two constructions 
because one of the principal difficulties in working with order statistics is to 
design a mathematically workable model. The author has found it fruitful 
when constructing systematic statistics to study a workable analogy which does 
not involve the order statistics directly, and then to build upon correspondences 


such as those described. 


2n 





of 
h 


Ww 
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Some may not wish to read further in this paragraph when they are informed 
that asymptotically the variance of p* is essentially the same as that of . They 
should proceed to page 404. For the others we proceed to the demonstration. 

Suppose we draw a sample of N pairs of observations (x, y;) from the bi- 
variate normal (20). If we discard from these all pairs except those with the 
n largest x; and the n smallest z; , we are left with the right set and the left set. 
We shall need the joint distribution of x,4; and ry_» 


N! ent . 
(36) J (tn41 ‘ lu—n) = (n!)(N — 2n — 2)! ([- g(x) iz) 


(| g(a) a) , (/ _ G2) ix) 9(Xn+1))9(tn—n)- 


where g(x) is the marginal distribution of x obtained from (20), N(x, a, a2). 
We assume 2n+1, Xv» Satisfy Condition 1. Considering 7,41, ty_, as fixed 
and given for the moment we wish to look at the distribution of the y coordinates. 
We may consider the y coordinates of the observations in the right set as drawn 
from the distribution of y 


[fa wae L Sle, y) de 
i) = Se. i intncreesee 


[ [ Se, y) dx dy “le dx 


Similarly the y coordinates of the observations belonging to the left set may be 
considered as independently drawn from 


[os v”) dx _ [Ct y) dx 
v'(y) = — = 


[ [ss = S(a, od dx a [ - g(x) dx . 


To prevent confusion, in oniiiedue the y order statistics of the two sets, we 
shall designate those of the observations which are members of the right set 
by U1, Ue, ***, Un ; While those observations belonging to the left set will have 
their ordered y coordinates designated 1; , v2, ---,Un. Of course the w’s and v’s 
separately satisfy an order relation like that given in (1). 

The first question we answer is: given 2,41, Yv-n, What is the probability 
that when we collate the u’s and v’s and split the observations into the upper 
set and lower set (see iv). there will be exactly c observations in the lower set 
whose y coordinates are designated by u’s? In other words what is the prob- 
ability that exactly c members of the lower set belong to the right set? An 
example for small values of n may clarify the problem. Suppose n = 4, and 
we observe uw < 01 < ve < U3 < Um < Us < 1% < uu; the y coordinates of 
the lower set of observations are U1, 01, V2, v3, and only the observation with uw 
for its y coordinate belongs to the right set, so for this casec = 1. To return 
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to our general problem, the probability that there are exactly c observations 
which are members of both the right set and the lower set is 

(37) Plc| angi, ty-n) = 1 — plac > Ue41) — P(Ue > Un—e41); 

where p(w > z) is the probability that w is greater than z. Now writing ¢(z) = 
[ ¢' (t)dt; ¥(z) = . y'(t)dt we may rewrite (37) as 


P(e | £041, Su~a) 
n! 


(38) - Lae | We I = VOTH Cre) de 


n! 


° qsotec lt. [V(vncoa) "11 = (teers) 'W/(On-er1) ne. 


After integrating the first integral of (38) by parts and simplifying we can rewrite 
(38) as 


P(e | Un+1 5 Ln—n) = 


yy Hera“ — ¥(ue+)I° 


soe _ 


n! 1 
Se ee nm—c 1 ae c— p 
+ (n = c)!(e _ 1)! (ue) ( « - 


We approximate the integral term of (39) by 


(39) 


ni! n—e el 
(n —c)i(e — 1)! [W(vers) — Wwe) (mers) ]” “TL — Y(uers)] 


which leads us to the approximation 


P(c | Ln+1, — 
(40) s—< c—1 
= woaie [y(u. +1)] [1 — ¥(Uc+1)] [1 + (c a I)P(tex1) — ey (u-)]. 


le! 


The joint distribution of u,, ue41 is given by 


Q(uc ’ Uc4+1 | XLn—n) 
(41) " n! 
~ (€= Iife —e — 1) 


Next we multiply P as given by (40) by Q from (41) and integrate out u.. This 


g(u.)~ (1 a P(Ue4 1))" "9" (Uc)e’ (Ue41)« 


gives us except for terms of O - and higher 


nin! 
(42) c!(n —c — l)!e!(n — c)! 


+ [L = o(ters)]" Ween)“ = W(ters) I 9’ (ues). 


[p(ue+s)]° 





oo 
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When expression (42) is multiplied by (36), we finally get the approximate joint 
distribution of c, 2n41, Lv—n y Uc+1 - 
Before proceeding further we let 


, [ f(x, y) dx 7 


=N-n 
pre 1 ja ts 


Uct+l Tn+1 
[- [Ose ae ay 
is > — = 





V(Uc+ 1) — 


In+1 zZN-n 
where Ai g(x)dzx, » = / g(x)dx. If we also let pi =l1- 
. — pi ‘ Do =i = Po we can «ite 
R(c, Xn41, Twn, Ue) 
_ Ky aes Ai a —2n—2 a" ot er pe ' "he", 


where the primes indicate derivatives of pa , Al, Av with respect to the appro- 
priate suppressed variables, u.41, %n41, Tv—n , respectively. 

We now proceed to the maximum likelihood estimate of p. Wetake the log- 
arithm of (44) and then take partial derivatives with respect to a the mean of 
z, b the mean of y, and p the correlation coefficient. After equating these 
partial derivatives to zero we have the following three maximum likelihood — 
tions which must be solved simultaneously to obtain the estimates 4*, b*, and p* 


as — dvi) 


(44) 


4 na—e- lop 
Az — AT da pr da 
oF +258 +58) ~0 
p: da ps oa ps da . 
* . * * 
146) if c—1épt , ¢ Op: , m— caps $m) = 0 


N 
(45) 


N pr db. pt ab ps ob pk ab 


= —- a 


pi dp pt ap pi dp ° pk ap 


* * 
(47) is Lopt 4, ¢ op: 4m — cap gm | = 0, 


N 
1 
where terms O e ) have been neglected. Equations (45) and (46) are satisfied, 


= , 
again except for terms O nN)? when 4* = 3(2n41 + ty—n), 6* = Uegi. Using 


this information we examine (47) and find it satisfied when 
* 

== Pi 
48 = + — 
( ) Cc My ce pi ’ 
which i is directly analogous to agution (27), and is the form promised in (35), 
ifn; =n—c. The estimate p* is obtained by solving (48) for p, where pr = 
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I / f(x, y) dx dy, and f(x, y) is given by (20) with variances equal unity and 
k* 


co 

* * ry 

means equal zero, and [ g(x) dx = = 1—r, = n/N. 
k* 


We shall not go through the derivation of o (p*) here. The usual maximum 
likelihood technique may be used. It turns out that the covariances between 


a* and p* and between b* and f* are O ( ) . Neglecting such terms we find that 


N? 
the variance is 
OK a * 
9 i 7 r —— ) 
49) 27a) pi (Mi Pi) 
(49 oF") QNrF pr? 


To summarize: if a sample of size N is drawn from a normal bivariate popula- 
tion with unknown parameters, the maximum likelihood estimate of p based on the 
2n observations composed of those observations with the n largest x coordinates and 
the n smallest x coordinates, may be obtained by solving for p the equation 

ia pr 


nm -—d®’ 
where } > X* = n/N > 0, pi = [ | f(x, y| o: = 1, a, = a, = 0) dz dy, 
0 JK 


N(a, 0, 1) dx = X*, and n — c ts the number of the 2n observations with 
k* 


largest y coordinates, which also have largest x coordinates. The variance of this 
estimate p* is given by 


o(s*) = pi(M — pr) 
ep) = ON BP” 


and for p = 0 the variance is minimized by choosing * = .2702, that is by choosing 
that 27 per cent of the observations with largest x coordinates, and that 27 per cent 
with smallest x coordinates, and for this value of d* 


i . , 1.939 
o opt (6* |p = 0) + —-. 


Equation (49) is of course exactly analogous to the expression given in (31) 
for the case of known means and variances. Therefore if the variance minimiza- 
tion problem is solved in general for the case of paragraph 5A, the large sample 
solution of the problem for unknown means and variances will also be solved. 

Figure 3 may be used to obtain the estimates f or p* in case the methods of 
paragraphs 5A or 5B are used. Essentially the figure solves equations (27) 
and (48). The procedure for the problem of paragraph 5A is 

m + Ns 


i) when n; + n3 > no + 4 evaluate the ratio —————————- = & and 


ny + M+ 3+ M4 
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find the intersection of the line x = x» with the curve for the particular \ being 
used ; . 

ii) through the point of intersection of the vertical line 7 = x and,the A 
curve draw a horizontal line; 

iii) the value of f is indicated on the vertical axis at the point of intersection 
of the horizontal line and the vertical axis; 











° 
w 








°o 
es 





ESTIMATE OF 7 





Fig. 3. Curves ror EsTIMATING THE CORRELATION COEFFICIENT p 


Ne + M4 
M1 + Ne + Ng + M5 
the same procedure, f will be the negative of the number appearing on the 
vertical axis. 
Example. Suppose a sample of 1000 is drawn from a normal bivariate popula- 
tion for which the mean of z is a, and the mean of y is b, and the variance of 
x is o; , all three parameters known (it is not necessary to know o3). The zy 


iv) when n; + n3 < me + ns use the ratio x) = and follow 
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plane is cut by the three lines x = a + koz, y = a, where, say, k = .612, so 
that \ = .27. Suppose we find the observations are distributed as follows: 


in the upper right-hand corner: 160 = m 
in the lower left-hand corner: 170 = nz 

in the upper left-hand corner: 110 = ne 
in the lower right-hand corner: 110 = m,. 


To estimate p we set up x = (m: + ms)/(n1 + Ne + 3 + m4) = 330/550 = 6. 
Referring to Figure 3 we find that the estimate of p, p = .20. 
In using Figure 3 for this case it is useful to know that for 


A= 50 k = 0.000 A= .27 k = 0.612 
A= 40 k = 0.253 A= .20 k = 0.841 
A= 30 k = 0.524 A= .10 k = 1.282 


If the means and variances of the variables are unknown, we may use the . 
method of paragraph 5B: 

i) when n — c > c evaluate the ratio (n — c)/n = 2x, and find the inter- 
section of the line x = 2 with the curve for the particular \, being used; 

ii) through the point of intersection of the vertical line x = x and the ), 
curve draw a horizontal line; 

iii) the value of f* is indicated on the vertical axis at the point of intersection 
of the horizontal line and the vertical axis; 

iv) when n — c < ¢, use the ratio c/n = 2 and follow the same procedure, 
p* will be the negative of the number appearing on the vertical axis. 

Example: Suppose a sample of 1000 is drawn from a normal bivariate popu- 
lation with all parameters unknown. Suppose we set n = 200, and follow 
the procedure given in paragraph 5B of this section, and suppose we find the 
observations are distributed as follows: 


in the upper right-hand corner: 50 = n — c 
then of course 

in the lower left-hand corner: 50 = n — c 

in the upper left-hand corner: 150 = c 


in the lower right-hand corner: 150 = c 


The estimate this time is clearly negative, so we set 2% = c/n = 150/200 = .75. 
Referring to Figure 3 we find using the’ curve corresponding to \ = .20 that 
the estimate of p, p = —.44. 


5C. The use of averages for estimating p when the variance ratio is known. 
Nair and Shrivastava [12, 1942] have considered the use of means for estimating 
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regression coefficients when one observation is taken at each of n equally spaced 
fixed variates, x; (2 = 1, 2, ---, n), and y is normally distributed. Their pro- 
cedure was essentially to consider the ordered fixed variates, and to discard a 
group of observations in the interior, much as we discarded the set of observa- 
vations whose x coordinates were 2n41, 2n42, ***, v—-n in paragraph 5B. The 
resulting estimates depended essentially on the averages of the y’s on the right 
and left sets of observations, and on the averages of the fixed x’s in the two 
sets. 

In an unpublished manuscript George Brown has considered a problem even 
more closely related to the one considered in paragraph 5A. Suppose x and y 
normally distributed according to (20) with equal variances o’, and means 
equal to zero. (The ratio of variances must be known, equality is unnecessary.) 
Retain only those observations for which | x;| > ke, and from them form the 
statistic 


_ % — §- 
(50) PB ie i. ’ 


where 7, and < are the average of the n; 2’s and y’s for which x; > ko and 
g- and Z_ are similarly defined for the n2 observations for which 1; < —ke. 
Then pg is an unbiased estimate of p. Regarding the x’s as fixed variates it 
turns out that 


1 — p*)o? f1 1 
(51) o’ (ps) = _— E ~ > 


(4, — <.. ny Ne 


If we approximate by substituting expected values for observed values (55) 
—k 
turns out to be (1 — p’)o’A/2N[g(k)]’, where \ = | g(x) dz, g(x) = N(z, 


0,1). The value of k which minimizes this expression is our old friend k = .6121, 
which gives \ = .2702. Therefore for p = 0 and large samples, the minimum 
variance is approximately 1.23 o°/N, for an efficiency of about .81. The relative 
efficiency of the methods of paragraphs 5A and 5B are .635 compared with the 
present technique. 

We presume that the analogous order statistics construction would produce 
much the same result. Our interest in the present technique is to supply an 
approximate answer to the question of what is to be gained by going from the 
counting technique proposed in paragraph 5B to the next level of computa- 
tional difficulty—addition. 
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THE NON-CENTRAL WISHART DISTRIBUTION AND CERTAIN 
PROBLEMS OF MULTIVARIATE STATISTICS! 


By T. W. ANDERSON 
Cowles Commission for Research in Economics 


1. Summary. The non-central Wishart distribution is the joint distribution 
of the sums of squares and cross-products of the deviations from the sample 
means when the observations arise from a set of normal multivariate populations 
with constant covariance matrix but expected values that vary from observation 
to observation. The characteristic function for this distribution is obtained 
from the distribution of the observations (Theorem 1). By using the char- 
acteristic functions it is shown that the convolution of several non-central 
Wishart distributions is another non-central Wishart distribution (Theorem 2). 
A simple integral representation of the distribution in the general case is given 
(Theorem 3). The integrand is a function of the roots of a determinantal equa- 
tion involving the matrix of sums of squares and cross-products of deviations 
of observations and the matrix of sums of squares and cross-products of devia- 
tions of corresponding expected values. 

The knowledge of the non-central Wishart distribution i is applied to two gen- 
eral problems of multivariate normal statistics. The moments of the gen- 
eralized variance, which is the determinant of sums of squares and cross-products 
multiplied by a constant, are given for the cases of the expected values of the 
variates lying on a line (Theorem 4) and lying on a plane (Theorem 5). The 
likelihood ratio criterion for testing linear hypotheses can be expressed as the 
ratio of two determinants or as a symmetric function of the roots of a deter- 
minantal equation. In either case there is involved a matrix having a Wishart 
distribution and another matrix independently distributed such that the sum 
of these two matrices has a non-central Wishart distribution. When the null 
hypothesis is not true the moments of this criterion are given in the non-central 
planar case (Theorem 6). 

2. Introduction. The well-known Wishart distribution is the distribution of 
the sums of squares and cross-products of deviations from the sample 
means of observations from a multivariate normal distribution. If the 
expected values of the variates change from observation to observation 
(with the covariance matrix constant), the distribution of sums of squares and 
cross-products is the non-central Wishart distribution. This distribution has 
been given explicitly [1] for the simple cases of the non-central problem. If we 
think of the expected values of each observation as defining a point in a space of 
dimensionality equal to the number of variates, we can say that the cases 
handled are those in which the points corresponding to a sample lie on a line or 


1 Part of a thesis submitted to the Mathematics Department of Princeton University in 
partial fulfillment of the requirements for the degree of Doctor of Philosophy, June, 1945. 
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a plane. Although the explicit formulas for the distribution of higher rank are 
extremely complicated and have not been derived, the characteristic function 
is relatively simple. The distribution in general can be given in terms of a 
simple multiple integral. 

The Wishart distribution is the basis of much of the sampling theory associ- 
ated with the multivariate normal distribution. It plays a role similar to that 
of the x’-distribution in univariate normal theory. It can be used in deriving 
the distributions of the generalized T” and of the multiple correlation coefficient 
when all variates have a normal distribution; it is used in deriving the moments 
of the likelihood ratio criterion for testing the general linear hypothesis (including 
the test of the means of several populations being equal) as well as deriving the 
moments of other such criteria’). For the problems of the T” and the test of 
the linear hypothesis and many other problems, the non-central Wishart dis- 
tribution must be substituted for the central Wishart distribution when the 
null hypothesis is not true. That is, the non-central distribution can be the 
basis of obtaining the power function for many tests in multivariate normal 
statistics. As an example of the application of the non-central Wishart dis- 
tribution to these problems, in this paper we obtain the moments of the gen- 
eralized variance and the moments of the criterion for linear hypotheses when 
the population means lie on a line or a plane. Applications to other problems 
such as testing collinearity, comparing scales of measurement, and multiple 
regression in time series analysis will be published in a later paper [3]. Another 
problem to which this non-central theory can be applied is a method of estimat- 
ing the parameters of a single equation of a complete system of linear stochastic 
difference equations (developed by T. W. Anderson, M. A. Girshick and H. 
Rubin). 

In [1] it was shown that one can make linear transformations on the observa- 
tions which simplify the derivation of the non-central Wishart distribution in 
the linear and planar cases. Consider a set of N multivariate normal popula- 


tions, each of p variates. Let the i-th (¢ = 1, 2, ---, p) variate of the a-th 
(a = 1, 2, ---, N) population be x;. ; let the mean of the variate be 
(1) E(ia) = Mia (¢ = 1,2,---,pja=1,2,---,N). 
Let the covariance matrix (of rank p) common to all N populations be 

|| (tia — pia)(Lja — Bja) || = || oc; ]| 
° (a = 1, 2, ---, N). 
The probability element of the z;. can be written as 
(2) fo? | (2x) exp [— § Deo” (ia — mia) (ia — mind ITT dec, 
where 

lho"? |] = [loss ll. 


2 See Wilks [2] for example. 














THE NON-CENTRAL WISHART DISTRIBUTION 411 


The sum of squares and cross-products of deviations from the means in a sample 
{Zia} are 


N 
(3) a; = Du (wie — 2)(Zjn — 2), 
where 
N 
Zz; = wl Lia 


The dimensionality, say t, of the space spanned by || wie || is equal to the rank of 
N 


(4) Il ill = || 22 Guiw = Bi)(uia — Bi) II, 
where ° 
1 N 
Kia = we, Mia- 


As a result of a linear transformation it was demonstrated that the distribution 
N-1 


of a;; is the same as that of >> x;42ja Where the z;. have a normal multi- 
a=1 
variate distribution with covariance matrix || o;;|| and expected values 
E(tia) = ia 


(¢=1,2,---,pja =1,2,---,N —1), 
such that 


N—1 ; 
| rai || = | 2U Mie |. 
The joint distribution of a;; is given for three cases: 
(i) Caset = 0: 
(5) W(ai; 083, T1373 P» N — 1,0) = Ko| 0? |?*™ | a; |?” exp [— td o” ais); 
t7 


(ii) Case t = 1: 
(6) Was, 03,7873 P,N — 1,1) = Kiexp[—3 20 6" ri] | 0” mr 


t52 
X exp |— 4c” a;;] [Daizgri Iya (V Dai 781); 
1,2 t,7 4.2 
(iii) Case t = 2: 
W(ai;, 045, 7143 3 P) N — 1,2) = Keexp(— 3 Doo” rid lo? PO 
42 


__ | AON= p—2) = ij = we 
X lass exp(— 42% aul De 0 oirain — 2] + w) 


=— ( — Ww ee 
X (ui + wm) _— Th (w—3)42w (Ju + ue); 
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where 


Pp 
= ae gl TT GIN — 4), 
t=1 


p—1 
es gin -1)—}(N—3) ri?) TT ra[n en il), 
i=1 


p—2 
a girin -1) -}(N—3) Pe) IL r(3[N = Baw il), 
i=1 


I,,(z) is the Bessel function of purely imaginary argument, and wu; and w are 
the two non-zero roots of 


(8) |\T —r»A*] =0 


(here T = || 7,;|| and A = ||a;;||). The number N — 1 is the number of 
degrees of freedom and t is the rank. The matrix || ;; || we shall call the sigma 
matrix, and || 7;;|| we shall call the means sigma matriz. 

Let «i, K2, -**, >» be the real, non-negative roots of the determinantal equation 


(9) |T —rAz| =0 


(where © = ||o;;||). There is a non-singular p X p matrix ¥ (= || yi; |] ) 
such that 


(10) vrv’ = / 
and 
(11) WTW'’ = || «i6;; || 


(where J is the identity; W’ is the transpose of ¥ and 6;; = 1 for 7 = j and 0 for 
t #7). Then the quantities 


Pp 


(12) bi; = Do Vin Vin One 
heal 


have the distribution W(bi;, 5:;, «i5:; ; p, n, t) where n = N — landx«; = 0 
forz = ¢+ 1,¢-+ 2, ---, p). This is the same distribution that would be 
derived if the b;; were defined by 


(13) b3; oo a Yia Yia 5 


where the distribution of the yia is 
pn 


(14) (2) *”” exp [— +2 a (Yia — Ki 8ia)]*. 


This simplified distribution of the observations has been called the canonical 
form. 
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3. The characteristic function of the non-central Wishart distribution. We 
shall find the characteristic function of the a;; and 2a;; (¢ # j) as defined in (3). 
We first obtain the characteristic function of the b;; and 2b;; (¢ ¥ 7) as defined 
in (13) and then perform a linear transformation to obtain the characteristic 
function of the a;;. The characteristic function of the b;; and 2b;; (¢ ¥ j) 
is defined as 


(15) B( exp | é 3, bi8e )) 


where 
0.3 = 05 


and 7 in the exponent is the imaginary quantity. 
We can write (15) as 


H(exp| | x es Yia Yia my 


= (27) ~ vf exp| - +> Gia = % Sia) +2 = Y Yiu Vind a; | 


t=l1 a=1 i,j=1 a=1 
1) 
x ITT ayia. 
t=1 a=1 


Let us first integrate the y,. fori = 1,2,---,;panda=t+1,t+2,---,n 
that is, make the integration 


or —_— | ” 


-exp | 5 3. a Via +¢ > S yea tin0 os [TT II dYia- 


1,j=1 a=l =1 a=t+1 


This is, however, the characteristic function of a Wishart distribution with 
n — t degrees of freedom [4], namely 


(16) | 8:3 — 260,; (4%. 


Now we must make the integration 


“ 1 t +00 +00 
(2n)4 exp| - 52% al f ve fe 
ep | EL + § XL, Dvavaty + Domne] XU a. 


t=] n=1 


(17) 


There is a p X p matrix G = || g;; || such that 


Pp 


Zz Gri den Ge; = 5:3, 
bal 
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where 
din = ben — 22Okn. 


Let us make the transformation 


Pp 
,= a Gin Znq + "ky, 


where 
|| a™ || = || dia [I 
Then the exponent of (17) within the integral sign is 


“HE DA Darah, 


t=1 y=1 
and the Jacobian of the transformation is 
| din . 
Hence, the integral of (17) is 


t t 
(18) | d” |** exp | -} e G—- Dd aaa) |. 
2 \q=1 7=1 

This result is obviously true if the @;; are pure imaginary and sufficiently small 
so that || d:;|| = || 6:; — 220,; || (which is real in this case) is positive definite. 
For all complex 6;; in a neighborhood of the origin (17) converges because the 
real part of || d;; || is positive definite. Similarly the integral of the derivative 
with respect to 6;; of the integrand converges for @;; in this neighborhood. It 
follows that the (complex) derivative of the characteristic function exists in this 
neighborhood because the derivative of the integrand is measurable and is abso- 
lutely integrable. Therefore, the characteristic function is analytic in a neigh- 
borhood of the origin. From this it follows that the characteristic function is 
analytic in an open set containing the flat space of real 6;;._ By analytic con- 
tinuation, then, (18) is the value of (17) in the open set containing real 6;;. The 
characteristic function (15) is the product of (16) and (18). Accordingly, we 
have the result that the characteristic function of the b;;and 2b; ; (¢ #7) defined 
by (13) is 


t t 
(19) |a*" exp| -3( Xd - Dard). 
Z n=1 n=1 


It is clear that if x, = 0 (for all y), this function reduces to the characteristic 
function of the Wishart distribution with n degrees of freedom, namely, 
(20) | 6:5 = 270; ; =, 


It is interesting to note that (19) factors into two parts, one of which is (20) 
and the other is 


a vo[-a(Ee- Eee), 
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The distribution function similarly factors into two parts, one of which is the 
Wishart distribution, whose characteristic function is (20). Thus the non- 
central Wishart distribution function is the convolution of a function (central 
Wishart distribution) and another (the transform of (21) the first of which is 
a factor of this same non-central Wishart distribution. 
In the planar case the characteristic function can be written as 

exp [—3(«i — d'«})] exp [—3 (x2 — d”x3)] 


| 6 — 270;; | | 63; — 270,;|m2’ 





where 71 + m2 = n. From this fact it is clear that the distribution for the 
planar case (if n > 2p + 2) is a convolution of two distributions each of the 
linear case. 
This deduction can also be made from the distribution (14). Let 
nit+l 


b;; = Ya Yj + a YiaYia » 


bi = ya¥e2 t+ LD YiaYia- 

a=n,+2 
Then it is clear that the b;; has the non-central Wishart distribution with n 
degrees of freedom and parameter «x; in the direction of the first coordinate 
axis, while the b;; has the non-central Wishart distribution with ne degrees of 
freedom and parameter «> in the direction of the second coordinate axis. Since 


bis = bi; + b5;, 


the distribution of the b;; is a convolution of the distributions of b;; and b;; . 
In general the non-central distribution is the convolution of ¢ distributions of 
the linear case (provided n > tp+ i). 

It is easy to show that if one has two (or more) non-central Wishart dis- 
tributions of rank 1 with parameters in the same direction, the convolution is 
again a non-central Wishart distribution with parameter in the same direction. 
Suppose b;; and b;; have non-central Wishart distributions with parameter «’j 

72 


and «x’’; in the direction of the first coordinate axes and n; and nm, degrees of 
freedom respectively. The characteristic functions are 


| a?" exp [—3(ar" — dai] 
and 
| d"? )"exp [—3(a — dy’). 
The product is 
| a” exp [—3(«r — a" ei)], 
where n = mn + mand xj = x’ + x4. 


Now let us deduce the characteristic function of the a;; and 2a;; (i # j). 


P . ° 
Since by (12) the b’s are transforms of the a’s we can write a;; = > vi y* bis. 
hk= 
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Then 


(22) E (exp 3 z Qi; 6s) =E (exp E > $i; iy y* bw |), 


where ¢:; = ¢;:. If we define 


P. ° ° 
(23) One = a, div, 
1,j7= 
then (22) can be derived by substituting (23) in (19). 
Let 
® = || ¢.;|I- 
Then 


ld; || = D=w'"'(a>" — rev" 
and 
D™ = WV(=" — 2) 'w’. 
The characteristic function of the a’s then can be written as 
exp [—}{ir(WTW’) — trl¥(z" — 216) W’ VT Y'I}] 


{|w’| . i} — Qib| . | w-]}in 








using (10) and (11). The denominator is 
{ie} -leir" iz" -— ae 
and the numerator can be written as° 
exp [—3{tr (M’WYM) — tr[M’W'Y(=" — 26) W’¥M}}] 
where 


= || Mia — Bi || 
and 
M'M = T. 


We may summarize in the following theorem: 

THEOREM 1. Given aj; (2,7 = 1, 2, ---, p) defined by (3) where the xe (i = 
1, 2, ---, pj; a = 1, 2, ---, N) are distributed according to (2), the characteristic 
function of a;; and 2a;; (¢ # Jj) is 


(oof $, a0) 


- sane ig ‘ ; 
_ |e AD » | - 5. > You — Hue — 2) | 
i 


o [sto—2) 


* exp : 7 . o 65530" (une — Hn) (Uke — A) ’ 


(24) 


7=1 a=1 
h,k=1 


3 The result follows from the fact that tr(AB) = tr(BA). 
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where 
I| 5 |" = |] eo? |] = |] 0% — 2i6.5 || 
and 
Gij = Oi - 


Suppose we have two sets of quantities a;; and a,; each set of which is dis- 
tributed according to a non-central Wishart distribution with sigma matrix 
|| o’ ||, one having n’ degrees of freedom (or n’’), means sigma matrix 7;; (or 
7;;) of rank ¢’ (or ¢”). Consideration of the characteristic functions (24) shows 
that 


, a 
Gis = Giz + Qi; 


has a non-central Wishart distribution with matrix || o*’ ||, n’ + n” degrees of 
freedom and a matrix 


Il res Ul = Wl res IL + Ul ves Il. 
The rank of the distribution is equal to the rank of \| 71; ||. This result can 
also be deduced from the representation of a;; and ai; in terms of observations 
from non-central normal populations. It is a straightforward generalization 
of the same result for central Wishart distributions. 
THEOREM 2. The convolution of two or more non-central Wishart distributions 


with identical: sigma matrices is a non-central Wishart distribution with means 
sigma matrix equal to the sum of the means sigma matrices of the components. 


4, An integral representation of the non-central Wishart distribution in the 
general case. It was shown in [1] that 


Wiz, 5:5, «45:55 D, m, t) = Co? / jn FO re 
where 


Pp 
co st efit (a'E) ier tp(p—l)+hpt II r(4[n wih il), 
t=1 


dB = {I [I a,,, 
t=] j=i 
Pp t 
dY = II II dys, 
t=1 q=1 
B = || bs |l, 
Y = || vig || (yn = 1,2, ---, 2d), 
K = || Kidiq || (yn = 1, 2, ---, 4), 


and the integration is on Y over the range || B — YY’ || positive semi-definite. 
This is equivalent to 
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(25) Ce"? | B peenern / ll-yY'B'Y Ma—p-t—D) ptriKry) dY. 


The integration is over the range of Y for which || J — Y’B™'Y || is positive 
semi-definite. 
There is a p by p matrix H = || h;; || such that 


H’B"H =1 
H’K = W = || wade ||, 
where w% are the roots of 
| Kis; = Ab* | _ 0, 
[|b |] = |] bs5 |. 
Then make the transformation to Z = || zi, || by 
Y = HZ. 
The Jacobian of the transformation is 
|H | =| BI. 
Then (25) can be written as 
(26) Ce** | B Ss / | I wal ZZ poe eft s) dZ. 


Partition 
| Z' \| 
Z= 
Zi 
such that Z,; is square (¢ X é). Let I — Z;Z, = E’E, (in terms of Z;), where 
E is specified uniquely and consider the transformation of variables from Z, 
to V defined by 
Z, = VE. 


Then (26) can be written as 


Ce? | B ‘iii aB | 1 _ sa." ow dZ, 


/ lI — Viv po av, 
where 
Wi — || Wydne | (n, é _ 5, 2, =a . 


The first integration is over the range (J — Z;Z;) positive semi-definite and the 
second is over (I — V’V) positive semi-definite. The value of the second 
integral is 
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p—t 
ri? TT r(aln — 24+ 1 — i) 
i=1 


| |Z om Hy pene dV = y 
I] rGin —¢ + 1-4) 


Hence (26) can be written as 
(27) C, ets | B pram / | I ee s Za —— eft Wiz) dZ; 


with 


Cc; = elt(K'R) oben top +40? 


28 t p—t 
” x [rGm —¢+1— 4a) [1 TGin —t+1— 4). 


The first part of (27) is, except for a constant factor, a central Wishart distri- 
bution with n degrees of freedom. The integral of the second part is obviously 
a symmetric function of the w;. In terms of the a;; the w; are simply the 
roots of (8). Wecansum these results in a theorem. 

THEOREM 3. Given a sample of observations {xia} (¢ = 1, 2, ---, p3a = 1, 2, 
-++, N) distributed according to (2), the probability density function of the sums 
of squares and cross products of deviations from the sample means defined by (3) is 


W(ai;, 0:3, 74337, N —1,0 =CGi| o” ss | as; ‘lili 


ln y é 
-exp| -} Dov au | f Ong = Do Zin2it 
i,j = 


4(N—2t—2) 


t 


t 
* exp ay Wi Zit II dat 
= 


ngé= 


integrated over 
t 
bat — Dy Zin Ziz 
t=1 


positive semi-definite where C; (n = N — 1) and 7;; are defined by (28) and (4), 
respectively, and where w;, are the t non-zero roots of (8). 


5. The moments of the generalized variance in the linear and planar non- 
central cases. 

5.1. The linear case. The generalized variance, which is the determinant of 
the variances and covariances,’ is a measure of the spread of the observations. 
If one thinks of the N observations of each variate as a vector in N-space with 


* This definition of Wilks [5] was made in terms of variances and covariances defined by 
a:;/N (from equation (3)). Since we consider a;;/(N-1) to be the variances and covariances 
we define | a;;/(N-1) | as the generalized variance. 
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origin at the sample mean, the generalized variance is proportional to the square 
of the volume of the p dimensional parallelotope which is defined by these 
vectors as principal edges. Another geometric interpretation can be given in 
terms of the p-dimensional variate space. The generalized variance is pro- 
portional to the sum of the squared volumes of all possible parallelotopes that 
can be joined by choosing as the p principal edges p of the N sample vectors 
(origin at the sample mean). 

In this section we consider the moments of the generalized variance when the 
distributions of the observations are non-central multivariate normal. In 
terms of the first geometric representation this means that the center of one or 
more of the vector distributions is different from the others. For convenience 
we shall assume that the distribution of the observations {y;.} is according to 
(14). This will give as much generality as if we treated observations {2;q} 
having the distribution (2). Moreover, we shall consider the determinant of 
sums of squares and cross-products instead of the determinant of variances and 


covariances. It is clear that the determinantant | b;;|, defined by (13), is 
(N — 1)”) of | ge | , defined by (3). 


Let us first consider the linear case, i.e., x = x, ¥ Oand x; = 0 (2 = 2, ---, p) 
in (14). The first of the p vectors is centered on the first coordinate axis, not 
at the origin. Then the probability density function of the b;; is 


p 
et | b:; pee exp | 3 2 bs: | oo (x’bu1)* 


i=] 

- Geel (4n + a)” 
oipn _tp(p—)) IT r(3[n ~ il) e - (3 + @) 
i=1 














simply a multiple (by | = 











(29) 








We wish to find the moments E( | ;;{"). Let 





b3; = Si8;'ij.- 
Then s; is the sum of squares of the i-th variate and || 1;;|| is the matrix of 
sample correlation coefficients. The Jacobian of this transformation (to sj, 
Tj) is 
(3)8-Y (92) 8 P- BD oe (s° yee» 
2 Pp . 
The probability element of the s’s and 7’s is 


exp | -a —} - “| Il (si) 


ian] —p—1 
a -_ | rij ~ p—1) 





p—l1 
(30) gipn tp(p—)) I] r(a{n _ i}) 


xo" __ pach TT Th ara. 


St 2alP(4n + a) tA iol jort+l 
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It is clear from (30) that the s; are distributed independently and that the set 
r;; have a joint distribution independent of the s?’s. Hence 


E(\ bi; I) = E(| 8; 8; ri; | I") - II E{(si)']- -E(| ri; | >. 


The probability element of s} (i = 2,3, ---, p) is 
e Mi (g?)in 
“PT Gn) 

which is simply the x’-distribution. The h-th moment of s? (¢ = 2,3, ---, p)is 


E((s)"| = 2 wth = 


d(si), 


The probability element of s; is 


—hx2 / 2\3n—-1 —he? w 

e ™ (81) é (Cs si)* 2 
(31) Qin z ea IT(an + ao 
This is the x’-distribution (non-central x’-distribution) which was given by 
Fisher [6]. Applying term-by-term integration (the series converges properly) 
we get the h-th moment 


0 2\a 
ray) oh te? (x’)"T (gn + h + a) 
BU(si)'] = Be 2 arn +a) 
The probability element of the r;; is the well known distribution of correlation 
coefficients, 


T?—(4n) |r fer) 2» 

stl p—l ™ Il, dr;;. 

=] ji 

er I] r(3[n ais i}) t=] ji+ 
= 


i | 153 jMa-e—) Pp Il ’ Ul T'(3ln — 4) 
rz = 


pte) jaa T?-1($n) : 


where the integration is over the entire (permissable) range o the r;;, we have 
as a consequence the h-th moment of the determinant (since n is arbitrary) 


__ Ten) | i | Ti tere ee P P 


E(\ r5 |") = 


— oy J IT dr; ; 
1 


=] j=i+ 


0 r(4{n — ¢]) 
0 rain — a] + hr? (an) 
I] rain - ayre"an + 
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Hence, the h-th moment of | s;sjr;; | is 


a p Yrdet- Os erga at 


il T(4[n — 7]) ano «2a lT'(4n + a) 


Let us summarize this in a theorem for the a;;. 

TuHeorREM 4. Jf the quantities a;; (i, 7 = 1, 2, ---, p) have the distribution 
W(ai;, 0:5, ti; 3p, N — 1, 1) defined by (6), then the moments of | a; ; | are given by 
p—l 


B(\ ax) = jo" partes EEG 1AM = ergy — 1+ hte) 


e amo 2all(3[N — 1 : 
TI raw -1-@ 0 Wall (5 ] + a) 
where x is the non-zero root of (9). 
The h-th moment of the generalized variance | a;;/(N — 1) | is obtained by 
dividing the above expression by (NV — 1)”. 
If x’ = 0, expression (32) clearly reduces to the moment given by Wilks [5] 






























[r@m+1-a+n) 
(33) gre ints 
I] r(a[n + 1 — iJ) 
















The expression (32) gives the moments of the generalized variance when the 
means of the observations are not fixed, but lie on a line. The distribution of 
| b;; | is not a simple function even in the central case. However, in any par- 
ticular case one could find the first few moments of | b;; | and fit a distribution 
function. It is to be noted that the convergence of the series is nearly as rapid 
as that for e**” 

5.2. The planar case. Next we shall treat the planar case for two dimensions. 
Suppose that «; ~ 0 (i = 1, 2). The probability density function of bu, bis, 
and boo is 


2 
exp | ~166 + x3) — 3 > bs | ita 

(34) a (bi: beg — biz) 

x 7 [xi K3(bu bee — bi2)]*(xi bu + Ke bes)? . 

ap—o 2+ lB IN(Z[n — 1] + a)P(gn + 2a + B) 
Let bi = 8}, bee = sz, and by = si%r. The Jacobian is s\s.. The probability 
element of sj , s; and r is 
oe dehteD 


PVE 


(si “ey” “a = pie -Gttep 





aed xy (Kasi si)" = 7)"(ci st + 1 99)? (si) d(@e) dr 


apo 2+8olBIT(4[n — 1] + a)P(gn + 2a + B) 
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We wish to find E{[sjs3(1 — 7°)]'}. Let us first multiply (35) by (1 — 7°)" 
and integrate from —1to+1. We then obtain 


28a a eet a 
2"e A(xF+ 2)(s?)!™ *(s2)" . 4(s?+83) 


» (ki Ko $1 82) "(ki 81 + ke 82) T'(3[n — 1] + h + a) d(s;) d(s5) 
apo 2°*+%qIBIT(4[n — 1] + a)P(4n + 2a + B)T(4n +h+ a)” 
Next we multiply by (s7)"(s3)", set («isi + x2s2)°/B! equal to 
(xi 8i)°" (x3 83)" 
8,+B2=8 Bi !B2! , 


and integrate sj and s; from 0 to ©. We obtain 


E([birbe2 — vie]") 


(36) = 2” exp [— 4(xi + «3)] 
afr p2—0 Qt +82 018.1 8.Ir(a[n — 1] + a) (dn + 2a + Bi + fe) 


r(ia[n — 1] +h+a) 
T(4n + h + a) 
which is the expected value we are seeking. 

Clearly this reduces to a special case of (32) if x2 is set equal to zero. 

Now we consider the planar case in p dimensions. Geometrically we have 
p vectors in n-space. If the {yi.} are distributed according to (14) the mean 
point (i.e., center of distribution) of the first two vectors is different from the 
origin, but the mean point of each of the other p — 2 vectors is the origin. 
The vectors are distributed independently. The determinant 


|b; | = | :: Yia Yia 


is the square of the volume of the parallelopiped which can be expressed as 
Vie *** Vp Sin 6, Sin 4 +++ sin Op4, 


where v; is the length of the 7-th vector and 6; is the angle between the (2 + 1)-st 
vector and the flat space determined by the first 7 vectors. The distribution of 
U3, ‘**, Vp and 6, ---+, 0,1 is statistically independent of v, , v2, and 4; ; for 
no matter what the plane of the first two vectors is, the conditional distribution 
of the other variables is the same. Hence 


E( | bi; |") = E[(vywe sin 6,)""] . E[(vyv4 st sin O. +*° sin 6,-1)”). 
If the y’s had simply the distribution 
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1 A 
——_—. oe ae . 
(37) (2m)*?” exp| 2 y a ve | ’ 
then the h-th moment of | b;; | would be (33), and the h-th moment of 
bu by 


bie bee 


- + 
V1 V2 SIN 6; => 















would be 


2 
on I] r([n +1 —i +h) 
: 


II r@m4+1—-% 


t=1 














Since the distribution of v3 , vs, ---, vp and @, ---, 0-1 is the same whether the 
y’s are distributed according to (14) or (37), we have 





rain + 2h + 1 — 7) 


(38) E(vs -- + vp sin & «++ sin 0,-4)"] = gh(r—2) = 


I T(3[n + 1 — 7) 














Multiplying (36) by (38) we obtain the h-th moment of | b;; |, namely, 


II rGln + 2h + 1 - 4) 
E( | bi; |") = 2”? exp [— 4(«i + «2)]) = ———______ 


Iran +1- 4) 





Sx (i) *PH(ag) Tn —h + a + By) 
a ,B1,B2=0 re a!B,! Be! r(3[n —1)+ a) 


Tign t+h+t+a+t &) Gln — 1]+h + a) 
T'(gn + 2a + Bi + Bo) (9n + h + a) 
This result may be summarized as follows: 


THEOREM 5. Let the probability density function of the quantities a;; (i, 7 = 
1, 2, ---, p) be 






W(ai;, 0:5, 7153p, N — 1, 2) 

defined by (7). Then the h-th moment of | a,; | is 
Pp 
; II raw — a) +”) 
B(\ auf) = [of 2 exp (— 3d +.) 
ITraiw — 4) 
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go) ye ADE) HPGIN = 1+ h+a+ BEGIN — 1+h+ a+ Br) 
~ pats 1B 1B TQIN — 214+ arGIN —1l4 2e4+6,+ 24.) 
a6;f2=0 2 a! 81! B2!P(GIN — 2] + a) PIN — 1] + 2a + fi + f:) 


r(ZIN — 2) +h+ a) 
rQgiN — 1) +h+a)’ 
with x; defined by (9). 
The h-th moment of the generalized variance | a;;/(N — 1) | is obtained by 


6. The moments of the criterion for testing linear hypothesis in the linear and 
planar non-central cases. 


6.1. The moments of the criterion. There are several linear hypotheses con- 
cerning the means of multivariate normal populations that can be included in 
a general formulation of the problem. We shall first of all consider a simple 
case of a linear hypothesis and find the moments of the criterion under linear and 
planar alternatives. In Section 6.2 we shall indicate some linear hypotheses 
that can be reduced to this simple case. Regression problems and the problem 
of equality of means in several populations (studied by Wilks) are included. 

Suppose the variates zi2 (¢ = 1,2, ---,p;a = 1,2, ---,n) and yx (¢ = 1, 2, 


- p;v = 1, 2, ---, g) have the probability element 
le® ponte : p on Pe 
(40) 


exp | - 3 > 7 0" (Yiy — Bin) Yiy — ain) | TIT dzia ITI ev, . 


i,j=l y=1 1 y=1 
Let us consider the hypothesis Hy that the means of mp y’s are zero, namely, 


Ho: Hiy = 0 (= 1,2,°°°, p37 = 1, 2, +--+, m) 


Let 

(41) ai = = Yay Hix y 
(42) b; = 3 Sia Sia; 
(43) Cig = Giz + Di;. 


Then the likelihood ratio criterion for testing Ho , called by Hsu [8]the Wilks- 
Lawley hypothesis, is the 3(n + g) power of 


(44) W = | Bi; | 
| cs 
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Under the null hypothesis the b;; have a Wishart distribution with n degrees 
of freedom, and the a;; are distributed independently of b;; such that c;; has a 
Wishart distribution with n + m degrees of freedom. Wilks [5] has given the 
moments of W and in some special cases the distribution of W. 

We shall now obtain the moments of W for distributions specified by (40) 
where the rank of || wiz || (y = 1, 2, ---, m) is 2, ie., the planar case. Under 
this assumption the b;; have a Wishart distribution with n degrees of freedom, 
the a;; are independently distributed in such a way that the c;; have a non- 
central Wishart distribution with n + m degrees of freedom. Let «xj and x3 
be the non-zero roots of 


(45) a Mij Miy — Aoi; | = 0. 
7= 


It is clear that the distribution of W is unchanged if o’’ is set equal to 6;;.  Fur- 
thermore, we can take 7;; = «,5;; then the c,; are distributed according to 
Wei;, 5:5, «75:53 p,m, 2) with n + m degrees of freedom. The moments 
will be obtained by a method similar to that used by Wilks [5]. 

Let the expected value given by (39) be 


(46) E(\ca;|") = K(n + m, h, p, «), 


which is a constant depending on n + m, h, p, «i, and ka. If D(a;;) represents 
the distribution function of the a;; , one can write (46) as 


1 


(47) K(n + m, h, p, 3) = ~~~ 
gion gt? TT rain + 1 — 4) 
t=1 


Pp Pp 
| | c5; |" | bs; |X"? exp [— 3 Do bi] Dai) I] I] @b;;aA 
i=l j=i 


where dA is the volume element of the a;;, and where the integration is over 
the entire (permissable) ranges of the b;; and a;;. Equation (47) holds since 
the c’s are functions of the b’s and a’s. Multiplying (47) by 


(48) aT] r(s[n +1 — i), 


then replacing n by g + 2 and dividing by (48) again, we obtain 


Pp 
gie(nt2) TT rain + . i] + g) 
i=1 


(49) K(n + m + 2g, h, p, xi) . 
2" TIT rGin +1—i)) 
cn ee 
ba Pp 
gir PY TT r(a{n + 1 — 4) 
i=l 
p Pp Pp 
| | cs; |* | bis "72-? exp | - 1» vil D(a) IIT daa). 
i=] t=] t=] 


1 














THE NON-CENTRAL WISHART DISTRIBUTION 427 


By definition the right hand side of (49) is the expected value of | ¢;; |" | bi; |°. 
Hence 


' 2” |] rin + 1 — i] + 9) 
E(| Ci; | | b; i) = K(n +m + 2g, h, P, Ki) tT 


Pp 


II r(dfn + 1 — i) 


In this expression it is permissible to set h equal to —g (n could have been 
replaced by n + 2g in (47) to insure the argument of each IT function being 
positive). Then we have 


E(W’) = E(\eis|* | bis |") 


Pp 
2” TT rin + 1 — i) +49) 
Ke +a + ®, -6.28,.0 ———__..... 


II r(a[n + 1 — 4) 


Finally, the g-th moment is 

I[ra@int+m+1-)]][rGm+1-d+ 9) 
E(W’) = exp[—4(«i + «3))@ 2» ____________ 
IT rain + m+ 1 —J +9) Tramt+ 1 — i]) 


«8 1.B2=0 | 2°27 +81 482 a! 8! Bo! T(3[n + m — 1] + 9 + a@)P(Gln + m) + a) 


; r(a[n + m — 1] + a) | 
T(3[n + m] + g + 2a + Bi + Be) 
We can summarize in the following theorem: 

THEOREM 6. Let zia‘(t = 1, 2, ---, pz a = 1, 2, ---, n) and yi, (¢ = 1, 2, 
-**; p37 = 1, 2, ---, g) have (40) as a joint distribution. Define ai; , bi; and c;; 
by (41), (42), and (43), respectively. Let xi and xz be the non-zero roots of (45). 
Then the g-th moment of W, defined by (44), is (50). Expression (50) gives the 
moments of W inthe planar case. The linear case is a special case of the planar 
case, that is, it is the planar case for % = 0. The g-th moment of W in the 
linear case is given by 


Pp 


[irGint+-m+1-][r@h+1-ad4+9) 


E(W’) = exp [— ei] $2@ =" ___________ 
(51) [rain +m+1—-4 +9) [rain +1 4) 


0 


“>> (xi) r(3[n + m] + fi) 
=o 292 8,!T(3[n + m] + g + Bi)” 


For «x; = 0, (51) reduces to the expresion given for the moments under the 
null hypothesis. 
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Wilks [7] has given the distribution of W under the null hypothesis for 
several special cases (i.e., certain pairs of nm and p). In general, however, the 
distribution function is too complicated to write down explicitly. When the 
null hypothesis is not satisfied (i.e., at least one xj ¥ 0) the distribution functions 
are yet more involved. Hence, we shall not write any explicitly. 

Hsu [8] has given the asymptotic distribution of W. Suppose that 


p m 


v= Zz Do Mix bir 


i,j=l y=1 


tends to the limit WV» as n tends to infinity (if the y’s are functions of n). Then 
the limiting distribution of x = —(n + q) log W (which equals —2 log A, where 
A is the likelihood ratio criterion) is 


¥ “— < Wo x 
52 g-iem 4Vo jpm—1 —}2?2 ee ; 
- C8 6 Pall Gpm + a) 


That is, it is the x” distribution with pm degrees of freedom and parameter Vo. 
For most purposes, alternative hypotheses of the means being on a line 
(i.e., of rank one) are sufficiently general. In any particular case, one can 
compute from (51) mumerical values for several moments and then fit an appro- 
priate distribution function. If one wishes to consider alternative hypotheses 
of rank two, one can use (50) and similarly compute numerical values for mom- 
ents. The series in either (51) or (50) converge rapidly. To construct an 
approximate power function for linear alternatives, say, one would fit distribu- 
tion functions for several values of «xj and find the desired percentage levels. 
There is a matrix || d;; || such that 


| bas I] = lees |] + [| des |’ 
and 
Il ass |] = |] dss |] + || Xs6e5 |] + [dis {I 
where the 2’s are roots of 
(53) | ai; —_ bj; | = ( 
It follows that 
Less || = || dss |] - |] 2 + ADSe5 || + |] aes II’. 
Then W can be written as 
|di| - | di |’ 
|dis| - | (1 + Aas] > | dis |’ 
1 

= — . 

I (1 + A, 

oa 


W = 
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The distribution of the roots of (53) in the linear case has been given by Roy 
(9] for a;; 0 dimensionality p.” The distribution in the planar case has been 
indicated by Anderson [3]. One could obtain the probability of W not ex- 
ceeding a given value by integrating the )’s over the proper range. 

6.2. Examples of the general linear hypothesis. A number of hypotheses 
concerning the expected values of variates with multivariate normal distributions 
can be put into the form of Hy. The equivalence of the hypotheses is demon- 
strated by means of linear transformations. 

As an example consider the hypothesis H, that the means of several normal 
multivariate populations are equal when the respective covariance matrices 
areequal. Let xj. be the a-th (a = 1, 2, ---, N“) observation of the i-th (¢ = 
1, 2, ---, p) variate in the u-th (uw = 1, 2, ---, U) population. Let 


(55) E(Xia) = Bi (¢ = 1, 2, ---, p) 
(u = agay”* *s U), 
and let the covariance matrix be || ¢,; ||. Then the hypothesis is 
(56) Hi, Mi = os (2 
(u 
For testing this hypothesis let 
U NY 


(57) b= DO Lhe — #)(c}. — 23), 


u=l1 a= 


; 
(58) a = DN“ — BG; — 4), 


where 
(59) 


Ea 


N = 


u=l 


The b;; have n = N — U degrees of freedom and ¢,; = ai; + b:;, have N — 1 
degrees of freedom. Then the N/2 root of the likelihood ratio criterion for H; 
is W defined by (44). For this case equation (45) is 


2D N*wi — i)(uj — Bi) — roiz| = 0, 


ii + Nut. 


N sai 


5 Roy erroneously claims his distribution to hold for the planar case and higher rank. 
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Hsu has demonstrated that the general regression problem can be put into 
the form of Hy. Suppose that xi. (¢ = 1, 2, ---, p;} a = 1, 2, ---, N) follow 
a multivariate normal distribution with covariance matrix || ¢;; ||, and let the 
expected value of x: be 


E (aia) = Do Bir Wr (q < N ed DP), 


where the g by N matrix 
W = || Ure || 
is of rank g. Let H2 be the hypothesis that 
H2:Bi = ||Bull=O (@=1,2,--+,p;u=1,2,---,m <q) 
with the w’s known. Let 
Wi = || Wue || (u = 1,2,---,m;a=1,2,---,N) 
We = || Wre || 
(r = m+ 1, ---, 9; a = 1, 2, ---, N), 
X = || rel (¢=1,2,---,pja=1,2,---,N). 


|| bs; || = XX’ — XW'(WW’) "WX 
\| cx; || = XX’ — XW2(W.W2) WX’. 


(with || c:;|| = XX’ if W. = 0). Then the likelihood ratio criterion for H, 
is the N/2-th power of W, defined by (44). 
The equation (45) can be written in terms of Z, B; , and W as 


(60) |B,Wil — W2(W.W2)"W2)WiBi — r»AZ| = 0 
form <q. Ifm = q, (45) becomes 
(61) | B.WW’B, — r»z| = 0. 


In (60) and (61) there are no more non-zero roots than the rank of B,. It is 
clear that the roots of (60) (or (61)) depend on the matrix W as wellas B,. The 
distribution of A the likelihood ratio criterion under the null hypothesis does not 
depend on the distribution of the matrix W (if W isnot constant). However, the 
distribution when the null hypothesis is not satisfied does depend on x; or on xj 
and x; , and hence, on the distribution of the elements of W as well as the value 
of B, ‘ 

The special case of Hy for m = q = 1 gives as the likelihood ratio criterion 
as a function of Hotelling’s generalized 7”. From the moments indicated in (50) 
we can deduce the distribution of 7” when the null hypothesis is not true [3]. 
This result has been obtained by Hsu [10] by another method. 
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ON HOTELLING’S WEIGHING PROBLEM’ 


By ALEXANDER M. Moop 
Iowa State College 


1. Summary. The paper contains some solutions of the weighing problems 
proposed by Hotelling [1]. The experimental designs are applicable to a broad 
class of problems of measurement of similar objects. The chemical balance 
problem (in which objects may be placed in either of the two pans of the bal- 
ance) is almost completely solved by means of designs constructed from Hadamard 
matrices. Designs are provided both for a balance which has a bias and for 
one which has no bias. 

The spring balance problem (in which objects may be placed in only one pan) 
is completely solved when the balance is biased. For an unbiased spring 
balance, designs are given for small numbers of objects and weighing operations. 
Also the most efficient designs are found for the unbiased spring balance, but 
it is shown that in some cases these cannot be used unless the number of weigh- 
ings is as large as the binomial cooticiont( ha ) or ( 1 - 4 x where p is the num- 


2 


ber of objects. 

It is found that when p objects are weighed in N > p weighings, the variances 
of the estimates of the weights are of the order of o’/N in the chemical balance 
case (o° is the variance of a single weighing), and of the order of 40°/N in the 


spring balance case. 


2. Introduction. The problem is fully discussed by Hotelling [1] and refers 
to the design of a certain class of simple experiments. We may consider the 
typical example of the class to be that of weighing several small objects on a 
chemical balance or other weighing device. Hotelling and Yates [2] have shown 
that the individual weights may be determined more accurately by weighing 
the objects in combinations rather than weighing each one separately. The 
designs are applicable to a great variety of problems of measurement, not only 
of weights, but of lengths, voltages and resistances, concentrations of chemicals 
in solutions, in fact any measurements such that the measure of a combination 
is a known linear function of the separate measures with numerically equal 
coefficients. The designs should be particularly useful in biological and chemical 
laboratories engaged in routine chemical analyses. We shall, however, in the 
interest of simplicity, discuss the problem in the language of weighing operations. 

A particular design is denoted by a matrix. The three objects to be weighed 
in four weighing operations may be weighed by the following design: 


1 Journal Paper No. J-1405 of the Iowa Agricultural Experiment Station, Ames, Iowa. 
Project No. 890. 
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where the rows refer to weighing operations and thé columns refer to the objects. 
In the above design the first two objects are weighed together in the first weigh- 
ing operation, the first and third objects are weighed together in the second 
weighing operation, etc. From the four resulting weights the individual weights 
are estimated by the method of least squares. The design problem consists of 
finding matrices which will minimize the variances of these estimates. 

There are two distinct though closely related problems here. One is to find 
efficient designs for the case in which the measure of a combination can only 
be the sum of the individual measures: This would be the case, for example, 
in weighing objects with a spring balance and we shall refer to it as the spring 
balance problem. The other problem is to find designs when an individual 
measurement may be either added or subtracted in a combination. This would 
be the case in weighting objects with a chemical balance (since an object may 
be put in either pan of the balance) and will be called the chemical balance prob- 
lem. In the latter problem the design matrix may contain 0’s, 1’s, and —1’s, 
whereas in the spring balance problem the matrix may contain only 0’s and 1’s. 

We shall use Hotelling’s notation. There are p objects with weights bh, 
b,, «++, bp to be weighed in N > p weighing operations. The design matrix 
is denoted by 


(1) X = || rai || a = 1, ---, N;i = 1, «++, p. 
Denoting the transpose of X by X’, let 

(2) X’X = || aj; || = |j a’? ||7 

(3) gi — D> Lai Yo 


where ¥/q is the observed result of the a-th weighing operation. The least squares 
estimates of the b; are 


(4) . b= Dawg, 
2 

and the. variances of these estimates are 
(5) o=a'o 
where o° is the error variance of a single weighing operation. The a” will be 
called variance factors. “ 

Hotelling’s main theorem states that or any design, a‘ > 1/N, hence the 
best possible design is one such the inverse of the product of the design matrix 
by its transpose has its main diagonal elements equal to 1/N. We shall call 


such a design an optimum design. Examples show that optimum designs do 
not exist for all values of N and p. 
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When an optimum design does not exist, the question arises as to how a 
best design shall be defined. In the present paper a design will be called best 
if the determinant of the matrix || a'’ || is minimized. A best design in this 
sense is, therefore, a design which gives the smallest confidence region in the 
b: (i = 1,2, --°, p) space for the estimates of the weights. 

In certain situations, other definitions of best designs may conceivably be 
preferred. Thus, problems may arise in which one might prefer: 

(a) to minimize the variance factors subject to the restriction that they be 
equal, (b) to minimize some function of the variance factors, or (c) to minimize 
only a certain subset of the a‘ on a minor of the matrix || a’’ || as might be the 
case when one wanted only rough estimates of the weights of some of the objects, 
but accurate estimates of the others. 

When an optimum design exists, the confidence regions are not only minimized, 
but, as Hotelling has shown, the variance factors are also minimized. It is not 
true in general, however, that a best design as here defined (minimum confidence 
regions) will also minimize the variance factors. Examples illustrating this 
point are given in the last part of section 6 and the first part of section 7. 


3. Hadamard Matrices. The problem of finding the best designs is closely 
related to the Hadamard determinant problem. Hadamard [3] proved the fol- 
lowing result: If the elements x., of a square matrix X are restricted to the range 
—1 < xa3 < 1, the maximum possible value of the determinant of X is N?", 
and when this maximum is achieved all x23 = +1 and the matrix is orthogonal 
in the sense that X’X is a diagonal matrix; the non-zero elements of X’X are 


all equal to N. A matrix X which satisfies these conditions will be denoted by 
Hy. Obviously if Hy exists for a given N, it is the solution of the design prob- 
lem in the chemical balance case when N = p. 

With regard to the existence of Hy , it is known that a necessary condition is 


N = 0 (mod 4) 


with the exception of N = 2. It is not known however whether the above 
condition is sufficient, although it is known (Paley [4]) that H« exists for the 
range 

0 < 4k < 100 


with the possible exception of 4k = 92. Paley and Williamson [5] give methods 
of constructing Hy, in the given range (excepting 92) based on the theory of 
finite fields. : 


When N is a power of two, Hy is easily constructed by taking direct products of 


Thus 
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Sylvester [6] first studied this class of matrices and Kishen [7] has described 
weighing designs based on this subset of the Hy . 

The following examples of Hadamard matrices may be found in the literature: 
Paley [4] exhibits an Hg, Hw, and Hos: Kishen gives an Hig. From these 
examples Ha, and Hz may be constructed at once from the direct products 
H,-Hyw and H2-Hy. The following is an Hx : 


l++41 
l++4+11 
l+1++ 


+++4+4 

b+ i+ 
l++1++i+ 

L++++4+14 
b++i+ti 


+ | 


l+++4 
l+++1+ 
l\+++++ 
b+1+4++1+1 


r++ii++ 
b+t+u¢+ite¢ti1+¢i+i+ 


b+ 1+++4+4++41 
l++++4++4H1 


I+1++41 


| + 


i+ | 











where the signs represent +1. This example was constructed by Williamson’s 
method [5]. Thus examples of H4, for the range 4 < 4k < 32 are immediately 
available and methods of construction exist for the range 36 < 4k < 88. 


4. Chemical Balance Problem. When N = 0 (mod 4) an optimum design 
exists if Hy exists and is obtained by using any p columns of Hy. When 
N # 0 (mod 4) we may construct very efficient designs as follows: If N = 1 
we may add a row of ones to Hy_, ; if N = 2 we may add two rows of ones or 
a row of H.’s to Hy_2 ; and if N = 3 we may delete one row from Hy,. The 
worst of these designs will be obtained when two rows of ones are added to an 
Hy-_2, and in this case the variance factors are 
6) il 1 N+2p —4 1 


~N-2N+2-—-2 ~-N-2° 
Since it is known that these factors must be greater than 1/N for the best 


possible design in this case, the above design will be quite near the best design 
for large N. 


For small values of N we shall consider only the case N = p, since if one 
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wanted to make N > p weighings, he would normally choose N to be a multiple 
of four because the gain in efficiency by using optimum designs is rather large 
for small N. In general more than p weighings would be required because o” 
is not usually known. Thus several additional weighings may be made in 
order to obtain several degrees of freedom for estimating o’. 

When Hy does not exist we have already defined the best design as one which 
minimizes the confidence region for estimating the weights; that is equivalent 
to maximizing | a;;| or minimizing | a’ |. There may be several designs with 
the same minimum, but we shall not give all of them. Thus when p = 3 the 
best designs are 


|}+t + O| i+ +4] ie + 7 
A@it+ ~- +1, 1% ~— + hee 14+ — +7 
H- +4 t-+4h |--+4l 
all of which have A = 16 (which is considerably smaller than the value 27 
that A would have if an optimum design existed). Using the notation 


99 


(a’’) ad (a", a o*, a 


while the second and third give 
(a‘‘) — 


For N = p = 5, two best designs are 


both of which have 
A = 372° and (a") = (2/9, 2/9, 2/9, 2/9, 2/9) 
For N = p = 6, a best design is 


|| + 


+ i+ ws 


which has 


A = 52” and all a = 1/5. 
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For N = p = 7, a best design is 


+i titi 


which has 
A = 2"3‘ and all a = 1/6. 


These designs were constructed by a method due to Williamson [8] which 
will be described in sections 5 and 7. It is interesting to note that no minor 


of an Hg is a best design for N = p = 7, for any minor of an Hs gives 
A = 2" < 2%3' and all a“ = }. , 


5. Spring Balance Problem. N = p = 44+ 3. WhenN = pandN =3 
(mod 4) the best possible design for the spring balance case is determined by 
Hy. if it exists. Let Ky+;: denote a matrix formed from Hy; by adding or 
subtracting the elements of the first row of Hy: from the corresponding ele- 
ments of the other rows in such a way as to make the first element of each of 
the remaining rows zero. Obviously 


| Kwa | = + | Hw4r}. 


and excepting the first row, the elements of Ky+: are 0 and +2 with the signs 
of the non-zero elements the same for elements in the same row. Let Ly be 
the matrix obtained by omitting the first row and column of Ky41 , by changing 
all non-zero elements to +1, and by permitting two rows if necessary to make 
the determinant of Ly positive. Then 


| Hwia | = 2" | Ly | 


and it is clear that, given Ly , one could reverse the procedure and determine 
an Hy,;. In the same manner, there is a correspondence in general between 
square matrices with elements +1 and square matrices of one less order with 
elements 0 and 1. The ratio of the values of corresponding determinants is 
always 2” if their determinants do not vanish; hence the 0,1 determinant will 
have its maximum value when its corresponding +1 determinant has a maxi- 
mum value. Thus | Ly | is the maximum value possible for a determinant of 
0’s and 1’s of order N, and the value is 


(7) |Ly| = (N + 17/2". 
The variance factors are 


a" = 4N/(N + 1)’. 
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We knew in advance, of course, that the a” would be greater than 1/N since 
an optimum design cannot exist unless the design matrix has its elements equal 
to +1, and we must here restrict the design to have only 0 and 1 as its elements, 
Since Ly is a best possible design for the spring balance case, it follows that 
designs for the spring balance problem can be no more than about } as efficient 
as designs for the chemical balance problem. 


6. Spring Balance N > p. When N > p = the device used in the chemical 
balance case to get optimum designs cannot be used. For if we select p columns 
from an Ly we may get rows of zeros which would waste weighing operations, 
A different approach is necessary and a clue is given by the designs Ly. In 
these designs p is odd and the objects are weighed 3(p + 1) at a time in each 
weighing operation. We shall show in general that objects should be weighed 
3(p + 1) at a time when 7 is odd, and we shall obtain a corresponding result 
for p even. 

Let P, be a matrix whose rows are all the arrangements of r ones and p — r 
zeros (0 < r < p). (The symbol should also have a subscript p but that is 
omitted because any specific value for p will always be clear from the context.) 


The matrix will have p columns and : rows. Let Q be a matrix made up of 


matrices P, arranged in vertical order. Let n, be the number of times P, is 
used in constructing Q. Q is a weighing design for p objects and 


v-En(?) 


weighing operations. The matrix Q’Q will have diagonal elements 


(9) a= ¥»(?7}) 


r— 1 
and non-diagonal elements 
i“ p—2 
(10) b= In (?— 3). 
The determinant of Q’Q is 
A = (a—b)? "fa + (p — 1) 


and we may write A in the form 


A=c"'d 


c=a—b,andd =a + (p — 1)b. 
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We shall prove the following theorem: 


If p = 2k — 1 where k is a positive integer, and if N contains the factor (?), 
then A will be maximized when n, = N Ff (?) and all other n, = 0. 
~ We shall demonstrate this statement by showing that if any n, (s ¥ k) is 


decreased and n,, is increased in such a way that N remains unchanged, then A 
will be increased. Let n, be reduced by an amount m so chosen that 


m = m(2) /(2) 


is an integer; we may then increase n; by m’ leaving N unchanged. It is readily 
found that these changes in n, and n,; produce the following changes in ¢ and d: 


_ »(P\ & — s)\(k — s — 1) 


é Pp 


both of which are positive on zero when s < k and A is necessarily increased. 

When s > k, Ac is positive but Ad is negative and it must be shown that the 
net effect of these changes is to increase A, we shall assume now that n, = 0 
when r < k. 


AA = (c + Ac)” “(d+ Ad) — ec” 'd < [c” * + (p — 1c” *Ac|(d + Ad) 
—c? 'd <c” [cd + (p — 1)dAc + (p — 1)AcAd] 


where in the second line we have omitted terms in Ac of higher order than the 
first. These terms are all positive since all their fectors are positive. The 
bracket in the last expression on substituting from (9), (10), and (11), may 
be reduced to 


(Dle- 9 BaO= i) +@-+- 93) 


+m (7) «ths aho~a“~ »|, 


s 
and then to 


m(?\ Xn. (? - aG mi r(k — 8s) + (k +8+1 — 2r) 


& r>k , p-—l 


+” @c ee |. 


Each term of the sum in the bracket is greater than or equal to zero when k > 1, 
tr > k, s > k since the fraction is readily seen to be negative or zero under these 
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circumstances. The fraction vanishes only when k = 2, r = k, s = k + 1, 
The other term in the bracket is negative but it is dominated by the term in 
the sum for which r = s, as may be shown as follows: The two terms in ques- 
tion may be written 


p-—1 _ , 66 — 8) + 8-2 +1 
SS 
- oe? 
42 
p\e-1 8 
and since n, > m, this expression is less than or equal to 
2 2 

n(? ie - 9 | +s s+1, ik — ns | 

a~ i p—l ps 


which is positive for s > k since the bracket is negative as may be seen by 





1 
factoring out ——————— and putting the result in the form 
BON wp — 1s - . 


(k — s+ 1)(s8' + (p — 1K) — pk(p — 8) + (28 + 1)(k — 38). 


Thus AA has been shown to be positive and the theorem is proved. 

The above argument has shown that P, or repetitions of P; give more efficient 
designs than any other combination of the designs P,, P:, ---, P.. The ques- 
tion now arises as to whether these are the best possible designs. We shall 
show that they are by considering the matrices Ly of section 5 which are known 
to give the greatest efficiency in the spring balance case. Let p = 4¢ + 3 


and let N = ba A o and suppose L, exists (i.e. Hp,; exists). Using Py 
as the weighing design we find the a;; are 
aj; = 2N(t + 1)/p 
ai; = Nit + 1)/p 
A single application of the design L, gives 
ay = 2(¢ + 1) 
_aj=tri tj 


and N/p repetitions of L, gives an a;; matrix with elements equal to N/p 
times the given elements for one application of the design. The two designs 
are therefore equivalent and P», is a best design. 

The variance factors for repetitions of the design P;, are 


(12) fat ft N = 0 Mod (2) 


~ N (p — 1)? k 
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and these are minimum variance factors’ as may be shown by an argument 
entirely analogous to that used in proving the theorem. Thus P;, is a design 
which not only minimizes the confidence region for estimating the weights, 
but also minimizes the individual variance factors. 

Efficient sub-matrices of the P; have not been studied except for small p, 
but we may point out that square sub-matrices of order p which are as efficient 
as P;, do not exist unless H,_,; exists, for by the argument of section 4, it is pos- 
sible to construct H,+: from such sub-matrices. Hence we cannot obtain vari- 
ance factors as small as those given by equation (12) when N = p unless Hp4; 
exists. 

The situation here is analogous to that in the chemical balance case. By 
a proper selection of N we can obtain a design with the maximum possible ef- 
ficiency for any odd value of p. But here we are much more restricted in our 
choice of N. In the chemical balance case N could be any multiple of 4 for 
which an Hy existed; in the present case N must be a multiple of p even in the 
most favorable instance (p = 4¢ + 3), and for some values of p it may be neces- 

' MP tens : P 
sary that N be a multiple of Lp +1))° 

We now turn to the case in which p is even. The theorem corresponding 
to the one given at the beginning of this section is: 


If p = 2k where k is a positive integer, and if N contains the factor e . > 


then A will be maximimized when 


m= ma = N / (245) 


and all other n, = 0. 

We shall not prove this theorem in detail. By arguments analogous to those 
used earlier, it may be shown that A is increased when either n, (s < k) is de- 
creased and n, is increased, or n, (s > k + 1) is decreased and nx; is increased 
with N fixed. This done, we may put all n, = 0 except n;, and n,41 and then 
maximize A with respect to these two variables subject to the condition that 


Nk (?) + Ney le ; ) = N. 


The values of nz and nz; which maximize A may be found by treating them as 
continuous variables and using the calculus. 
The variance factors for these designs are 


«4 =p = a 


2 The author is indebted to a referee for suggesting this property of the design, and 
for several other valuable suggestions and corrections to the paper. 
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but these are not minimum variance factors. In fact one can obtain smaller 
variance factors than these by using only P, in the design (omitting P;,,, en- 
tirely). In this case 


(14) i pas) +" N =0 mod (?) 


and 


o- +1. ». 

p p+2 

We have not found explicitly the design which minimizes the variance factors 

for p even, but it appears that the design would be made up largely from P, 

with a small proportion of the design devoted to Pi4:. Thus (14) is very 
nearly the minimum possible variance factor. 


when p> 2. 


7. Spring Balance Designs for Small p. When p = 2, each object may be 
weighed r times by itself, and the two objects may be weighed together s times 
to give 


ass {| = "3° aa 


and if A is maximized subject to 2r + s = N we find 
r=s=WN/3 
a’ = 2/N 
provided N is a multiple of 3. The most efficient basic design is therefore 


|1 1 
X =||10 
01 


in accordance with the previous section. When N is not a multiple of 3 the 
best design is obtained by using the first row of X for the odd weighing when 
N = 3¢ + 1, and the last two rows when N = 3¢ + 2. 

The case p = 2 is notable in that there is almost nothing to be gained by 
weighing the objects in combination. For the variance factors 2/N would 
be obtained by simply weighing each object separately N/2 times. The ad- 
vantage of weighing in combination is only that square confidence regions in 
the b, , be space are replaced by ellipses with somewhat smaller area. If a = 
(r + s)/(r’ + 2rs) is minimized subject to 2r + s = N, we find 


r = N(3 — V/3)/3, a* = 1.866/N 


so that the a” are reduced slightly from 2/N but at the expense of increasing 
the area of the elliptical confidence regions. 
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For p = 3 the most efficient design when N = 3 is 


110 
X=)]101 
011 


as given by Lsor P;. Itis easily shown that for N > 3, the most efficient design 
is given by repeating X even when N # 0 (mod 3). Thus for N = 4 we would 
repeat one row of X, for N = 5 we would repeat two rows of X, and so forth. 
The variance factors are 


a N =3t 


9 
4N 
_ WW+)) 
~ 4(N — 1)(N + 2) 
9(N + 1) 
4(N — 2)(N + 1) 
For p = 4 we may attempt to find by trial and error a sub-matrix of the 
design given by using P, once and P3 once, but this would be a tedious process 
and the labor would soon become prohibitive for larger values of p. Hence 
another method must be found for obtaining the best designs when N = p 
except when L, exists. A method is provided by Williamson [8]. Let D, 
be the best design for N = p. Williamson shows that when p < 7, D,-, is 
a minor of D, , hence D, may be found by adding a row and column of variables 
to D,-, and expanding the determinant of the result by the bordered expansion. 
For small values of p it is easy to determine by inspection what values the 
variables should have in order to maximize the resulting expansion. William- 
son determined D, and D, by this method. 
There are two types of D, which give a maximum value of A = 9 


|1001 

1110 
and oo 14 . 
}0101|| 


The variance factors are all 7/9 for the first of these, and for the second 
(a") = (7/9, 7/9, 7/9, 4/9). 


When N = 5, p = 4, there are a number of designs which give a maximum 
A of 19. None of these however has all a" equal, and we shall give only one 
example: 


N=3t+1 


N = 3t+ 2. 
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(ai) = G 12 13 4 
~ \19? 19’ 19’ 197° 


which has 


When N = 6, there appears to be no design superior to P.. It has variance 
factors all equal to 5/12 and A = 48,—a very large gain in efficiency over 


N = 5 at the expense of one additional observation. 
When p = 5 there are three types of D; which give A a maximum value of 25, 
none of which has all variance factors equal. An example is 


Ds; = 0 


with 
a eane 
25’ 25’ 25’ 25’ 25 
For p = 6, an example of a Dg with all a" equal which maximizes A is 


111000) 
(100011 
10011 


0 | 
001101 
] 0 
] 


— 
illo 


1 
1 
0 
0 ] 


1 10) 
10101) 
with A = 81 and a" = 17/27. This example was constructed by the bordered 
expansion method from D; and it turns out to be a sub-matrix of P;. It is not 
as efficient as P; , however, since substitution of N = p = 6 in equation (14) 
gives a‘’ = 13/27. Hence we have shown that there does not exist a minor of 
P; (for p = 6) of order 6 which is as efficient as P3 itself. 
For p = 7, there is a most efficient design given by L7. 


with A = 2" and alla” = 7/16. 

D, for p = 8, 9, and 10 could presumably be constructed from L; in the same 
way and the designs for p = 4, 5, and 6 were constructed from L;, but the 
computations become very tedious for these larger values of p. 

The designs given in section 3 were constructed from the above designs by 
the method described in section 4. 


._3> = me 


a. | 


HOTELLING’S WEIGHING PROBLEMS 445 


8. Bias in Measuring Devices. In some kinds of experiments it may be 
necessary to estimate a bias in the measuring scale in order to estimate the meas- 
ures Of the objects. Such a bias may simply be regarded as an ,additional 
object to be measured except that it is an object which must be included in all 
the measuring operations. In the chemical balance case the bias presents no 
difficulty, for if an Hy exists, then there exists an Hy with a column whose ele- 
ments are all +1]. Such an Hy may be constructed from any given Hy by merely 
changing the signs of all elements in rows which begin with a minus sign. The 
result will be an Hy with +1’s in the first column and that column may be 
assigned to the bias. We note that the gain in efficiency by measuring objects 
in combinations is even greater in the case of a biased measuring scale than when 
there is no bias. For if the objects were measured separately, their measures 
would be estimated by the difference of two scale readings and would have vari- 
ance 20°; hence the variance factors a‘' are to be compared with 2 (rather than 
1) in the case of bias. 

In the spring balance case, the additional restriction that all the elements of 
one column be one necessarily reduces the efficiency of the designs in the sense 
that the variance factors for p objects and a bias will be larger than the variance 
factors for p + 1 objects without bias. When the measures of p objects and 
a bias are to be estimated from N = p + 1 measuring operations, a best design 
may be obtained by adding a row of zeros and a column of ones (in that order) 
to the best design for VN = p without bias. This can be seen by recalling that 
there are two determinantal exressions for the volume of a simplex with one vertex 
at the origin in a Euclidean p space. (A simplex (Sommerville, [9]) is a polytope 
with p + 1 vertices bounded by p + 1 (p — 1)-dimensional hyperplanes.) The 
determinant of the best design for N = p (without bias) is proportional to the 
volume of the largest simplex with one vertex at the origin and the other vertices 
restricted to be selected from the vertices of the unit cube. A determinant of 
order p + 1 with a column of ones and the other elements zero or one also gives 
the volume of a simplex with vertices selected from the vertices of the unit cube. 
Hence the two determinants (one of order p and one of order p + 1) must 
have the same maximum value, and as one of the vertices may be selected ar- 
bitrarily in the case of bias, we may select the origin. 

In general, for N > p, similar geometrical reasoning will show that the best 
designs for the spring balance problem in the case of bias are easily constructed 
from Hadamard matrices as described in the following theorem: 

lf X is a best design for the chemical balance problem in the case of bias and if X 
contains a row of +1’s, then a best design for the spring balance problem in the 
case of bias is given by replacing the —1’s in X by zeros. 

We have seen that the best design in the chemical balance case is obtained 
from a Hadamard matrix with a column of +1’s. Obviously the matrix may 
be also made to contain a row of +1’s by changing the signs of certain columns. 
The design X consists of the column of ones together with any other p columns. 
The determinant of X’X is 1/p!’ times the sum of squares of the volumes of 


a set of simplexes in a p space. There are ( ) of these simplexes deter- 


pt+il 
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mined by the different combinations of the rows of X taken p + 1 at a time, and 
the coordinates of their vertices are the last p elements of the rows of X. The 
vertices are therefore selected from the vertices of a cube in the p space which 
has its edges parallel to the coordinate axes, the origin at its center, and the 
lengths of its edges equal to two. Since X is a best design, the vertices are 
selected so as to maximize the sum of squares of the volumes of the simplexes. 
Now in the spring balance case we must maximize the sum of squares of the 
volumes of a set of simplexes which have their vertices selected from the vertices 
of the unit cube. Obviously this may be done by selecting vertices correspond- 
ing to the selection given by X. Thus it is necessary only to set up a cor- 
respondence beteen the vertices of the two cubes. Since X contains the vertex 
(1, 1, 1, 1, ---, 1) which is common to both cubes, the natural correspondence 
which identifies a vertex such as (1, —1, —1, 1, —1, 1, ---) with (1, 0, 0, 1, 
0, 1, ---) may be used. 

The variance factors for these spring balance designs are 4/N (for any p < N) 
when N is a multiple of four and Hy exists; when N is not a multiple of four 
and modifications of Hy as described in section 3 are used, the variance factors 
will differ from 4/N by terms of order 1/N’. 


9. Addendum. After this paper was written, the paper of Plackett and 
Burman on “The Design of Multifactorial Experiments” appeared in Biometrika. 
Volume 33 (1946), pages 305-325. A part of this paper discusses Hadamard 
matrices much more completely than we have done in section 3. In particular 
Plackett and Burman have constructed all Hadamard matrices of order less 
than or equal to 100 (excepting 92). 
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THE APPROXIMATE DISTRIBUTION OF STUDENT’S STATISTIC 
By Kar-Lar CHuNG 
University of Peking, Kunming, China 


Summary. It is well known that various statistics of a large sample (of 
size n) are approximately distributed according to the normal law. The asymp- 
totic expansion of the distribution of the statistic in a series of powers of n™ 
with a remainder term gives the accuracy of the approximation. H. Cramér 
[1] first obtained the asymptotic expansion of the mean, and recently P. L. Hsu 
[2] has obtained that of the variance of a sample. In the present paper we 
extend the Cramér-Hsu method to Student’s statistic. The theorem proved 
states essentially that if the population distribution is non-singular and if the 
existence of a sufficient number of moments is assumed, then an asymptotic 
expansion can be obtained with the appropriate remainder. The first four 
terms of the expansion are exhibited in formula (35). 


1. In a fundamental paper’ P. L. Hsu [2] has devised a method for obtaining 
the asymptotic expansion of the distribution of various statistics. The present 
paper deals with the so-called Student statistic. 

Let 


£1, fa, °°, &n 


be n independent random variables having the same probability distribution 
represented by a distribution function P(x). The rth moment and rth absolute 
moment are denoted by a, and 8, respectively. It is assumed that a; = 0 
and that for a certain k = 3, Bk < © and that a, > 0. Hence there is no loss 
of generality in assuming that a, = 1. 

Student’s statistic is defined as 


n ~% 
(é (é, a 4 ” 1 n 
E “= 2 where — = — )>é,. 


N r=1 


For brevity, we consider 


1 -} 
né (= (& — »') , 
Let its distribution function be denoted by F(z), i.e., 


Pr {ne be (& — p') < ‘S = F(z). 


1 The definitions of the various constants A, Ax, Q., Ar, 38, @, @,, are the same as 
in Hsu’s paper. 
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Discarding the case k = 3 where we can prove a more precise result and the 
singular case which can be shown to admit no asymptotic expansion in the sense 
of Cramér [1], we shall prove in this paper the following theorem: 

THEOREM. Let P(x) be non-singular and ax, < % for some integer k = 4, 
Then 


1 Ds 
(1) F(z) = &(z) + x(z) + R(z), &(z) = val e dy, 


where x(z) is a linear combination of the derivatives ®'(z), ---, &*"(z) with each 
coefficient of the form n™"(1 < v < k — 3) timesa quantity depending only on az4, 


-+ +, a_i Whose beginning terms are given in (35) and where 


k , 
(a ]+)) 
where Q, is a constant depending on k and P(x). 

We shall need some of Hsu’s lemmas, i.e., his lemma 3, lemma 7 (both for 
the particular case m = 2) and lemma 8. These we shall quote with this num- 
bering. The application of Hsu’s method to Student’s statistic depends on the 
following lemma. 


oof 
(2) |R(z) | S Ql + Jz“), a= — 


2. Lemma A. Foru 2 —1,1 2 1, we have 


21-1 r (3) ei (o- 1)’ r 3) | 
1+ p a. w—i|1lt+ ee ase reganeee uy! 
j=1 Tr ¢ ~ ir + 1) j=l r3 ” i) rj+) 





3 
—ee 21-1 r(3) 
<Vitusi+ 2 7 — | 
j=1 7 ; 
r(; 7 i) r(j + 1) 
Proor. By Taylor’s expansion of +/1 + u, we have 
21-1 r(3) 
Wits =1¢ Fo 
a r3 - i) rj+1) 
‘() 
i} age egitommen peer = 
r 3 ~ 21) r(2l + 1) 


whence it follows that (1 + #u)?*“'” is finite, and positive. The right- 
hand side inequality follows. 
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Similarly, if wu = 0, 


3 3 
ee : r(3) 
l+u2 i+ p> aeons EE saecinnn tes aE 
“rG-s)ro+n  rG-a)ra+y 
21-1 r (3 , a1 (—-1)T (3) 
>1+ genet ne Be aggre u® 
j=l ; , i= . . 
r(3- i)rG+ 1) 'r(3-5)rG+ 
since by a well-known result on the binomial theorem we have 
i+> =~/1—-1=0. 


“r3-s)roty 
For —1 S u < 0, we have 
21-1 r (3) 
1+ Pay 
rm r(3 - 3) rgj+) 


For —1 S u < 0, we have 


w-Vitu=>, say. 


(3) = 
21—1 r 9 
pait wa vig 
_ r3 - i) rj+) 
21-1 (-—1)'Tr (3) 
8 ce 


” r(3 — i) rgj+) 


Next, 
3 2 
21—1 r (3) : 
N = De Beg nm a — (1 + u) 

1 : , 
: ri - 3) r(jt+ 1) 
is a polynomial in u of the form 

U (ao + ayu + +++ + anyu’’) 


where a) > 0 and the successive coefficients have alternating signs; hence for 
—1<u<0,a)+ ayu+ --- + ae,u”’ assumes its maximum at u = —1. This 
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maximum is obtained by putting « = —1 in the numerator, hence for —1 < 
se < @, 


3 3 
21-1 (= 1)’ r 3) 
7" r(3 - i) rj +1) 
The left-hand side inequality in the lemma now follows. 


For brevity we write the inequalities as 


(3) 1+ Pe(u) = 1+ Poa(u) — byw S71 tu S 1+ Par(u), bx >O. 


3. We write 


dL (& — B= & — nF = n+ Vn(u — 1) X — ¥’, 
where 
X = > at Y = Vn. 
r=1 V/ na — 1)’ 


Then Student’s statistic may be written as 


ni(S@—B) =v(at4/4='x-F)- 
r==l , n n 


Then, for every z, we have 


Fe) = Pr{y (1 + yf =" x - ai < 


= Pr{/i4 Ey 524/14 4/23}. 
n = n 


For brevity let 





itty =v, 4/4=1x = 0. 


Suppose z < 0; then we have by (3), 


2+ 2Pyi(U) S$zVW1+ U S z+ 2Po,(U) 
Pr{V 32+ 2P2.(U)} S F(2) S$ Pr{V S 2+ 2zP2,(U)} 


(4) 


Suppose z > 0; then we have by Lemma A a similar inequality with the 
extreme terms exchanged. 


Now we take 1 = EI. and fix it henceforth. 
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Our next step is to obtain an asymptotic expansion for 


2-4 
Pr{V S 2+ 2P,(U)} = Pr{y < (1 +2) 


2\ 4 ——_ 
+2(1 +2) Pal g/S=! 1 x) 
n n 
with m = 21 — 1 or 21,1 = 1. 

Let b be any real number, and 


z - a, — 1 
sil+-—)} Pe —— X}) = L,,(X). 
n n 
Until section 12, we shall write simply L(x) for either of the L(x). 
4. Let W be the probability function of the distribution of the random point 


(X, Y) and let f(t , t) be the characteristic function. 
W(S) = Pr{(X, Y)eS} for every Borel set S in R, 


fit) = ff aw = \P (Fe va} 


p(t , &) = [ gitu(aa—D- Kat) tite gp 
Then . 
(6) Priysb+LMj= /[{ a= [few+[ [ c@y aw 
ySb+L(z) ysb 
where 
( 1 fb<ysb+ L(z), 
G(z, y) = <-1 ifb+ L(x) <ysb, 
0 otherwise 
We approximate G(x, y) by H(z, y), where 
[ — ifb<y <b+ L(z) 
H(z, y) =e ifb + Liz) <y <b 
| 0 otherwise 
We approximate dW by (w(x, y) + v(z, y)) dx dy, where 


1 wm oe 
wey) = aoe | [tot t) dade 
(zy )= oe [ [ e824 5 (t, to) With site) dts dte 
” (27)? lists Made ? ? 


, , (@ — Ve as 
— Tht? +2ptst2 +42) a. Ss ——— 
ma = e(Ta4) Vou —1 
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- — 


and y¥(7t;, tt) is given in Lemma 3 by taking therein ¢, = 


one it 
Wie ow i” fe =&,¢ 


being any of the é,’s. 
We write 


[ [ " Gley — u) dW 
- l ; l "(Gey — u) — Hey — w) aw 
a [ fo (G(x,y —u)— H (x,y — u)) (w(x,y) + ¥(x,y)) dy dx 
+ [ [ H(x,y — u) dW 
~~ [ [. H(x,y— u)(w(x,y) + y(2x,y)) dy dx 
- [. : G(x,y — u)(w(x,y) + y(a,y)) dy dx 


5. We have 


(7) |G(z,y — u) — H(z,y — u)| S1— 6" S ex! 


| [. [. (Gay — u) — H(x,y — u)) dW | < [ : et dW = cE(X") < Qe 





since 
“ 1 22 
E(X"') S$ A,E < Q, 
(X") sue (F—5) sa, 
where Q; depends on a3, ---, asx . 
Similarly, 


(9) | [ (G(a,y — u) — H(2x,y — u))(w(2,y) + v(x,y)) dy dz | < Qu 


Next, 
[ : j Fey — u)(w(z,y) + (x,y) dy dz 
10) = ff wen) +rewyayde — ff (xy) + rey) dy ae 
ySutb+L(z) anes 


where the first term on the right- mae side, regarded as a function of n~’, has 
a Taylor expansion in powers of n™*, whose first few terms we shall compute in 
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section 9; for the present let us denote it by B(u + b) + C(u+ b)n?*” where 
C = C(u + b) is a constant depending on k, P(x) and z, a more explicit estimate 
of which will be given in section 10. 

Further, we have 


(11) [ [ G(z,y — u) dW = / / dw -| [aw 


ysutb+L(z) ysu+b 


by Cramér’s asymptotic expansion for the mean +~/nY, and as is also shown 
in Hsu’s paper we have 


(12) [ Jaw = [J cola + vem) ay de =a 


ysu+b ysut+b 


Collecting all the results from (5)-(12), we get 


| | dW — B(u+b) — C(u+b)n?* 


ys ut+b+L (z) 


= Ap(e + n 2%) + [ | H(x,y — u) dW 


“a [ : [ . H (x,y — u)(w(2,y) +7(x,y)) dy dx 


Now we use A. C. Berry’s weighting factor = 


[sete ( I] dW — Blu +5) — Clu +6) du 


ySu+b+L(z) 


= AiT(e +n 7**) 


(13) ‘4 c 1 — con Te (f- [ Ses -~ oar 


U 


and obtain 





_ [ [ : A (x,y — u)(w(x,y) + y(x,y)) dy as) du 
since 
[ ae du = oP. 
6. To transform the triple integral on the right-hand side of (13) we use the 


Fourier transform as Hsu did. 
Let 


re [ l e ev (y, y) dy dx = h(t te) ; 


> 











454 KAI LAI CHUNG 


then 


[ [ e eau (yy anaes u) dy dx = e “h(t, ‘ A) 


[ [ e ttt itey ([ ae (x,y — u) au) dy dx 


u 
= {a0 = || Mit) Ital <7 
otherwise 
since 
"1 — cos Tu inn mT —|é#\|) if |a|<T 
I ——— a ; 
u 0, otherwise 


By Fourier inversion we have, almost everywhere, 


[ 1 —_ > Tu H(a, y— u) du — xf [. eftietttsy | to )h(ts, ts) dts dt; 


Hence 


[eer [ H(x, y — u) dW du 


(14) 0 T 
“ ~ [ [. (T — | te |)f(t, ta)h(tr, te) dtp dt, 


Similarly we obtain 


[ 1— x Tu ([- [ H(z, y — u)(w(z, y) + y(2, y)) dy 7) du 


as s [ [ (T — | te |)O(h, &) {1 + Wt, th) }A(tL, t) dee dty 


From (14) and (15) we obtain 
1a ces Pu ( f i] H(z, y — u) dW 


7 [ fo H(z, y — u)(w(x, y) + y(a, y)) dy dz) du 


(15) 


(16) a ah 
=o] [ w-iedue.y 


— (tr, te)(1 + Wit, , ite) }h(th, te) dt, dt, . 


7. To estimate the double integral on the right-hand side of (16) we break 
it up into parts and use the following estimates of h(t, &). 


Lemma B. We have forl = lf = t,. 
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2 
(1) | h(ts , )| S Aglz| D> (on — 1)¥ n-¥ Ot, 
7=1 
(2) [h(t, &)| S Qeti2’N( ||, n?, eM) 
where N( | t |, n~"””, €7) is a polynomial with constant coefficients in the indicated 
arguments. 
, Proor. 
|h(tr , &)| = [/ e tte itgy—ez?! dy dz — lf en itiz—itgy—ex?! dy de 
b<ysb+L(z) b+L(z)<y<b 
b+ L(z) b 
AS -4 Lyne 
L(2)20 “b L(z) <0 J b+L (2) 
i (—tt,)~ gq [ caer ae a 1) de. 
Hence 
hh, t)| S|ta(* | |eL(2) Je ae. 
: Since 
21 ’ 
| L(x) | < Ae lz| 0 (uw — 1)¥ n* | 2)? 
j=l 
we obtain 


| h(i, tr) | S Av|2| 2 (a aa 1)¥ n-¥ [ \2 hen in 


2l 
< A, | 2 | y ® (ou — 1)# no twat 
7=1 
Next, we write 
(ta, tt) = (— tye" [ w"(a)o(a) dx 
with 
” — ptt _ ,—ex2l, —ite L(z) 
u"(z) =e °™, v(x) =e ® (e — 1). 


Integrating by parts twice, we get 


hit, t) = (— ity [ ulado""a) de 
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whence 
|h(ts, ts) | < Apt? [ en) (2) | + ex | L(a) | + e | 2 (| La) | 


+ €x" | L(x) | + ||| La) | }de < Qh*(\z| + 2)N (lol, v4, €*) 


The lemma is proved. 
Now we write 


LL (T — |@|){f — 61 + Wh db dt, 


(17) fia ff « [[ =n+nt+h. 


lt1]SQxnd lt11>Qnn4 lt1|SQnnt 
li2|SQnnt ltalST Qent<|te|s7 


On J; we use Lemma 3 and Lemma B, (1): 
© © 1 
[Ti] = Q\2| [ | r be n —_ > 
co J—co j=l 
43 (| t; \* vce | t; ia B02) (ef +48) dts dt, 
i=l] 


2 
Saisite?* 2 «4? ie. 


j=l 
On J; we use Lemma 7 and Lemma B (2). Since | | > Q.n', | (1 + y) | < 
e-"**, and by Lemma 7, p(tin™*, tn) = e~** so that | fli, t) | < "**, |f — 


#(1 + y)| < &"% 


I, = Qy2” / / The" N(|t], 2 *, € ') dle dt. 


Iti] >Qunt 
Ite] ST 

Let « = n'*, 6 > 0, then it is evident that 

| I2| < Que’. 
Similarly using Lemma 7 and Lemma B, (1) on Js we see that 

lIs| S$ Q|z|. 
Therefore 

| 1 


i [. (T — |&\)i fi, &) — o(t, &)(1 + Wit, ih)) }h(h, &) dh di 





= 21 
< Q: ( 2 | 4 2 + lz | Tn 7k Zz en ; 
j=l 





' WN 
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8. Combining (13), (16), (17) we obtain 
if [J dW — Blu +b) ~ Clu + bn) du 


(19) ySUtb+L (2) 
21 

= (re + Tat” + lel +22 + |2| tnt? > n em, 
j=l 


Now we shall choose 7 and ¢ suitably. Let 
T=n", e=n*, a>O, B>O. 


To make the right-hand side of (19) a constant depending on z only, we must 
have a S 3(k — 2)-8 Sa. Then 


21 21 

ae ty 2 —l)j 2 
YY HG > CDs et, 
call j=1 


We must choose 8 < k/2, then 


21 

ve 7 
Yn (j+1)/21 < A,n® 2/2 
j=l 


To make the exponent as small as possible we choose 8 = a, then 
2l 
—i(k—-2 hj —(+1)/2 ~}(k—-2) + (2a— ol 
Le] Pak Do yh Hs Ay] | nt eat |g | pitta ae 
j=l 


since a is to be as large as possible, we choose 





a= a= min( £5? 2 ¢- *) l= H 
, 2° 24+)’ 2\° 
Then we obtain 
| ° 1 — cos me ( aa | 
ee dW — Blut b)— C(ut+ b)n du 
(20) b ¥ vs i (z) 
< Q.(1 + 2’). 


Let F*(u) be the distribution function of Y — L(X), and let 
F,(u) = B(u) + C(u)n**™ 
Then we may write (20) as 
I *1— cos Tu 


(21) > (Feu +b) — Fy(u + 0)} du 


< Q.(1 + 2’). 








By the definition of F,(u) we see that the conditions in Lemma 8 are all satisfied 
with a certain constant D depending on k, P(x), and z for the M therein. Then 
choosing b to be the a in Lemma 8, we obtain from Lemma 8 and (21), 


(22) pro{s [" 1— 288 ae — abs Qa +2) 
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where 
on i a 
56 = SD lu.b. | F*(u) — Fi(u) |. 
' Now there exists A such that if 75 > A, then 


Thy 
3 [ eee a se > 2, 
0 x 


2 
hence it follows from (22) that 
TS < max (A, D“Q,(1 + 2’). 
Thus for another Q; exceeding both A and the above Q; , we have 
TS < Q(1 + 2’) 
and so finally, dropping the prime, 


(23) | F*(u) — Fi(u) | S$ Q(1 + 2)DT” = Q.1 + 2)Dn™. 
In particular, taking b to be z(1 + nz) = 2’, say: 
(24) Pr {¥Y — L(X) S 2} = Ble’) + C(e’)n** + Al + 2)Dn™* 
where 
Biz’) + C(z’)n* = the Taylor expansion with a remainder of 
/ / (w(x, y) + (a, y)) dy dz, 
y—L(z) Sz’ 


and D is an upper bound for 
| B’(u) + C’(u)n** |. 


9. Let \ = n+, and rewrite the z’ + Lz:-:(x), 1 = 2 there as g(a): 


1/2 aya 
go) = 2 (14 SSD yy HEB tat 4 1) Nat + ++) 





16 
Then 
g() = 2’ 
— 1\3 
g'(0) an (a 5 1) 2'2 
g’(0) = -= ; lg 
_ 48/2 
g’”(0) ‘ani 3(a4 1) z'2°. 


8 
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Let p 2 0, ¢g 2 0; w,,(2, y) = w(x, y) where w(x, y) isdefined in section 


oPta 
ax” dy* 
4 and we know that 


@ 2? Peet y?)/20—p?) 


ee 
w(x, y) ~ QrvV/1 — p? 
Let . 
2 g() 
(26) fold) = [ [ Woq(x, y) dy dz. 
Then 


Fad) = [9 Oeopa(t, 9(0)) de 


or) S00) = | O’Odue(@, 9) + 9A) eus(t, 90))) de 


mn 


Fmd) = | (G'"O)mp(@, 9d) + 84/009" Ot o1x(2,900)) 


+ g(A)We.er2(x, g(A))) dx 
Let 


! a dt 
e= 00) = Fel a, a= eo 
(28) . 


[xq = [ X'Wyq(x,2') dx 


We have computed the following table of values of I’7,,: 


0 | & 0 0 0 0 
1 = po? +1) ad oo” 0 0 0 
2 oe” 4. po? +2) QV" +1) 90 0 
3 a 3pb% 7) po? +3) a 36 sa Spo +2) an Genet? a 60 0 


Next, we find, from (25)-(28), 
fo(0) = 4; 
Srq(0) = Tee for g 2 1; 
_. ase 
S(0) = SOV gr, 


(92) 
1 


272 
z' Tp.¢+1 








at —1 ’ a= 
frq(0) = — “- 2'T5g + 4 


_ 3 (a as py (cx si ye 


Sq (0) = 8 


3 873 
2'T nq 2” T pate 


3 o f 3/2 
fo ~ oT net + 
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Now we can expand 


[/ w(x,y) dy dx = [ CC w(x,y) dy dx = fod) 


ysb+L(z) 


Write the Taylor’s series for foo(A): 





fea) = foo(0) + foo(0)r + 220) y2 +O 4... 
Substituting from (29), we get 


ec g(A) tl 1/2 
[ [ w(x,y) dy dx = & — (a — 1)" p2’b” 


Qn1/2 
— 1 9 
(30) 4 5 {—2/(o + p®”) + 2(o™ + pe”)} 
(a — 1)" .., a) 3 (3) 2 (2) 3x (4) 
+ — 48n32 {32’(—3ph™" — p&") — 32" (—3p8" — pb”) 


+ 2"(—3pb® — p'b®)} + --- 


Further, we must obtain the beginning terms of +(x, y) as given in Lemma 3, 
for which purpose we refer to Hsu’s paper. We have, in fact 


2 ° _ iU® U, U3 1 Us os ot) 
¥(th st) = — Ga + ( 36)at\ioo ~ 72 + aeJant 


where 


2 


P| , 
Us= E(u G5 + te) 

















= at + 3-V uy — 16h +3= oar itt aie 8 
# 2 
won 8 SEH (efes +4) 
V cu 
—4 
= (a — 3) + 4 ‘Ve = fats >: 


— eas. 

= (as — 1005)f2 + ++ - 
To avoid the exhibition of very long expressions, let us separate the terms 
in (it , iz) according to the powers of n™””, and denote the terms of the power 


nn, n*” by tn, vo, vs , respectively. 
Thus y: = —iU;3/6n"”, and the corresponding y(z, y) is 





2 2—1 3 
tae) (n= +4) 
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1 innit 
(31) 1 (ty) = — 6ni?2 (cxeaz) +3 Vay — 1 wie(z,y) 


+3% —_ walz,y) +: - -) 


a4 





where, as hereafter, the terms omitted will yield nothing in the long run. 
Now we have by (31) and (26), 


co pg (A) 
[Lf new aa 


1 (asf + 3Va4 — 1fn(z, y) + :3 08 fa, y) +: 2) 





1 

~ 6nl! 
= a- 

= -on (caf) + 8Va4 — 1fu(0) + 35 —"* A + - --) 





a (ca s(0) + 3Va% — 1 fi2(0) + g&.... = f fu(0) + - 7) 


1 
— ape (on 


(32) = -< a i/2 (a3 162) = = as zZ "(a3 103 + 38Va 1 is) 


— 28 $50) + - .-) 














aa 





mt ~ 2 
+ oe tne + 3V eu — Tis + 3 — th) 





— 2" (aalie + 3Vaq— 1 + 3% “she. 


as a po” Eo e some a z' (a3 po + 3+/m — 18”) 


én? 


+ Sr | eat” + pe) + 6Vay — 1p +6 “oe o| 


as z” | eato'” +5 26%) + tVa.— 1 — 1 pe” +6 “oe a +... 


Similarly, omitting the intermediate steps to save space, we have 








x g(a) 
[ [. n@w ay dr = 2 (3a — 38 + 2050} 
(33) _ (a =)" z' <3(a, — 3)pe™ + 12 a 52 o® 
14473? , Vigo | 


+ 2a5 pb” + 1203+/a4 —iee} + +. 
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0 pg(A) 
[ [ y3(z, y) dy dx 
(34) 1 10 
_ 1 (a — 3 ~ (4) Of3 4 (6) (8) Cae 
an i *? ty? + fie )+ 


Combining (30), (32)-(34) and simplifying, we obtain, as the first four terms 
of the asymptotic expansion of F(z): 


[[ @@ + 1@, w) dy de = & — 2, (eo + 0) 





Se 7" 
y—L(z) Sz’ 
1 fa — 3.) , 2) 
+ i 6 - * 9 ” 
2 2 
1(% ow 1 2(% — 1) — a 20) uw — 1 " 
-}- 2” (a 5 ® 6» +2 # ao) 
(35) + iL __ % Seer 10a3 oe” 4 2 3 Oly o® + as 6® 
24n3/2 5 Ss 9 


+ 2! E — 303 = Daz a4 oe > 7003 (0%4 = 1 —_ as) —_ 2a5 @®) 


2 2 
4 a3(a3 — — 54 + 7) oe” _ as 2” | 
3s 3 


2 2 
4A E= — 1) o® — . «| uke 


10. In order to estimate the remainder C(z’)n™“~” in the Taylor expansion 
we write, in accordance with Lemma 3, 


@ pga) 
[ [. @@ a + r@, ») dy ae 


+ 2” |= as, + 3a3 — bas oP 4 a3(303 — Tox + 7) 2] 


eo g(a) k-3 
e jo [ {wt y) + 2, v2z(— _ Or 7qWry»_ (2, y) dy dz 


= fo) +L XE-D" On han®) = LIPO) 


4+ #4) —~ _ ae) “ mit LAA 1)"*"*a,,,, 


"TN te dt we | 
24 fan) 5 + fi" O™) Gay 


awe ete TH oft 
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3-4 k—3 


= Be’) + foo” ) gz yt 2 E(— 1 aan Serve COR) 


v1"3 


(k—2—»)! 
= Be’) + An” ( ~~ + : “2 @)), 
Thus . 
C(’) = (58 0 (A) + it @)) 
Now we may write : 


(k—2 


2) = [DAG @))woele, 9(0N) dx 


where, if we attach a weight s to g“(@d), the polynomial under the integral 
sign is isobaric of weight k — 2 — vin these g”’s, and the coefficient of each term 


is a constant multiple of a certain w,,(z, g(@A)). Further, it is easily seen by 
induction that we have 


g” (0) = Prsoe(z)(1 + @r727) 


where P42. (z) is a polynomial of the three variables z, x, 6 which is of at most 


the (1 + 2s)th degree in z and of the (21 — 1)st degree in x, and whose coef- 
ficients are all A; . 


Therefore, 
se @) | s [ Qle| tate + le 2) 
(LH | 2 [PPE gaat, g(Od)) ax 
s [ Qu(la| ta toe + le) + [2 PPO ew de 


< Q.(1 + |z|**) 
Thus 
(36) Cz’) S$ Q(1 + |z/*~) 
Lastly, an estimate of D is easy: 


co = putL(z) 
Pi) | =|2 [of cole, w) + ve, w) dy ae 
(37) 


s [ w(x, w+ L@) + |e, u + LG) |) dz SQ. 
Collecting the results of (24), (36), (37) we obtain 
Pr{Y — L(X) S 2’} = Bz’) + An((1 + Jz [**)n ** + 1 + 2) ™), 


ay = min{ => * ii Ls 
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Or, more simply, 
(39) | Pr{Y — L(X) S 2} — Bee’) | S Q@(1 + |z|*)n™, 
where the first four terms of B(z’) are given by (35). 


12. To return to F(z). We see that B(z’) depends on the function L(z), 
Recalling section 3 we now write B,, for the B corresponding to Ly, , with m = 
21 — 1 or 2. 

Then by (4) the value of F(z) lies between 


Pr{Y — Ly1i(X) S 2’} and Pr{Y — Lo,(X) S 2’}. 
From the asymptotic expansion just obtained for either of them, we see that 
the absolute value of their difference does not exceed 
| Bors(z’) — Bar(2’) | + Qe(1 + lz Pe. 
But 


Loj(x) = Lers(x) — 2’ber(o4 — 1)'n~“*x”! = Dg4(x) — bax” say, 


hence 


0 p2z’+Lo1—4(z) 
| Bua’) — Bue) < | f w(x, v) + v(x, 9) | dy de 


2/+L21—1(2)—b3,22! 
< Qebn << Q|z|n" < Q@|z|n™. 
Therefore 
| Pr{Y — Loi(X) S 2’} — Pr{Y — Lo(X) S2’} |S Qn” 

and so we obtain 
(40) F(z) = Biz’) + Al + |2/**)n™ 
which is equivalent to (2) in the theorem stated in section 1. 

Thus the theorem will be proved if the assertions regarding the form of f(z) 
in (1) are shown to be true. 

For this purpose we denote, as before, the terms of the order n”” in Wit, it) 


and y(x, y) by ¥, y, respectively. Since the term in y, which yields a wy, 
with the greatest g is U3, we have for every wy, in 7, the condition g S 3v. 


°o g(a) 
We expand | | v(x, y)dydz to k — 3 — v terms, in which f,,(0), fq(0), 


-+, f**™” (0) occur. In the integrand of f**”(0), e.g., the coefficients of 


each w,,(2, z) are polynomials in z and z of a total degree in z and x not exceeding 
that of (g/(0))**”, ie., 2(k — 3 — v). Hence the expansion of y, will give 
rise to terms of the form 


ZIi,, @g@s3y, st+t=22&k—3-— »). 
Such a term will yield a term 2’®°*”, which in turn yields the terms &” with 


= TH 
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rSstqtis 3v+ Ak —3 — ») S 3(k — 3), 


Equality holds only when vy = k — 3and q = 3(k — 3). But when vy = k — 3, 
the term in question is 


fosu—s(0) = Io. = 8 ™ 


Next, we see that y, contains U;, ---, U,i2. Since f“*” (0) is a poly- 
nomial of the (k — 3 — »v)th degree in x, the expansion of y, will yield I°,, 
---, 15°”. But 4°” = 0ifp>k—3—»v,heneep $k —3— >». Thus 
in y, we need only take account of the terms (7t,)”(7#)* with p < k — 3 — ». 
Now if j < k — 3 — »v, in U; only az, ---, agus) occur. Ifj7 =k —3 — », 
in the coefficient of a term (it:)’(it.)* with p < k — 3 — »v the greatest index 
of ais 


2k—-3—y»y+j—-—(kK-3 —vy) =jt+k-3-—vsk-1 


since 7 S v + 2. Hence in the expansion of every y only a3, ---, a_; occur. 
The proof of the theorem is completed. 
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SOME IMPROVEMENTS IN SETTING LIMITS FOR THE EXPECTED 
NUMBER OF OBSERVATIONS REQUIRED BY A SEQUENTIAL 
PROBABILITY RATIO TEST 


By ABRAHAM WALD 
Columbia University 


Summary. Upper and lower limits for the expected number n of observations 
required by a sequential probability ratio test have been derived in a previous 
publication [1]. The limits given there, however, are far apart and of little 
practical value when the expected value of a single term z in the cumulative 
sum computed at each stage of the sequential test is near zero. In this paper 
upper and lower limits for the expected value of n are derived which will, in 
general, be close to each other when the expected value of z is in the neighbor- 
hood of zero. These limits are expressed in terms of limits for the expected 
values of certain functions of the cumulative sum Z,, at the termination of the 
sequential test. 

In section 7 a general method is given for determining limits for the expected 
value of any function of Z,,. 


1. Introduction. Let x be a random variable and let f(x, @) be the elementary 
probability law of x involving an unknown parameter 6. Let Ho denote the 
hypothesis that @ = 6, and H, the hypothesis that 6 = 6,, where 4 and 4, 
are given specified values. The sequential probability ratio test for testing Ho 
against H, , as defined in [1], is given as follows: Put 


= i f(x: » 91) 
(1.1) zi = log Fas , 6) 


where x; denotes the 7-th observation on x. Two constants, a and b are chosen 
where a > Oandb <0. At each stage of the experiment, at the m-th trial for 
each positive integral value m, the cumulative sum 





(1.2) Zim | + oe + em 


is computed. Experimentation is continued as long as b < Z, <a. The first 
time that Z,, does not lie between b and a, experimentation is terminated. The 
hypothesis H; is accepted if Z,, = a, and Ho is accepted if Z», < b. 

Let n denote the smallest value of m for which Z,, does not lie between b and a. 
Then 7 is the number of observations required by the sequential test. The 
expected value of n is a function of the true parameter value @ and is denoted 
by E,(n). 

Upper and lower limits for E,(n) have been derived in section 4 of [1]. These 
limits, however, are of little practical value when the expected value of 
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_ f (z, 61) 
(1.3) z=lo flax, 0) 
is in the neighborhood of zero, for they converge to + © and — ©, respectively, 
as the expected value of z approaches zero. It can be shown that the expected 
value of z is negative when @ = @, and positive when @ = 6.’ Thus, if the 
expected value of z is a continuous function of 6, there will be a value 6’ between 
@ and 6, such that the expected value of z is zero when 6 = 6’. Hence, the 
limits for E,(n), as given in [1], are of no practical value when 6 is near 6’. 

The purpose of this paper is to derive upper and lower limits for E,(n) which ; 
will be, in general, close to each other when @ is in the neighborhood of 6’. Thus, 
it will generally be possible to obtain close limits for Z,(n) over the whole range 
of 6, if the limits given here are used for values in a certain small interval con- 
taining 6’, and the limits given in [1] are used when @ is outside this interval. 





2. Notation. We shall use the following notations throughout the paper. 
For any random variable u, the symbol E,(u) will denote the expected value of 
u when @ is the true value of the parameter. The conditional expected value of 
u, under the restriction that some relationship FR is fulfilled will be denoted 
by E,(u| R). The symbol P(R | 6) will denote the probability that the rela- 
tionship FR holds when @ is true. 

The cumulative distribution function of z will be denoted by F(z, 6) when @ 
is the true value of the parameter. The moment generating function of z, 
when @ is true, will be denoted by ¢(t, 6), i.e. 


0 


(2.1) g(t, 0) = [ e” dF(z, 6). 


3. Assumptions concerning the family of distribution functions F(z, @). In 
this section we shall formulate two assumptions concerning F(z, 6) which will 
then be used to prove various lemmas and theorems. Since we are interested 
in values of 6 near 6’, we shall restrict the domain of @ to a finite closed interval 
I containing 6’ in its interior. It will be understood throughout the paper that 
any statements concerning @ refer to the domain J, even if this is not explicitly 
stated. 

ASSUMPTION 1. The moment generating function g(t, 0) exists for any point 
t in the complex plane and any value 0, and is a continuous function of 8. 

ASSUMPTION 2. There eists a positive 5 such that P(e* > 1 + 6| 6) and P(e* <1 
— 5| 6) have positive lower bounds with respect to 0. 


4. Proof that y(t, @) is continuous in t and 6 jointly and that all moments of z 


are continuous functions of 6.” In this section we shall prove the following 
theorem: 


1 This follows easily from Lemma 1 in [1], p. 156. 


2 The original proof of the author was somewhat lengthy. The present proof was sug- 
gested by T. E. Harris. 











468 ABRAHAM WALD 


THEOREM 4.1. Jt follows from Assumption 1 that g(t, 0) 7s continuous in t and 
6 jointly and all moments of z are continuous functions of 86. 

Proor: First we show that g(t, @) is a bounded function of ¢ and @ in the 
domain |¢| S t&, for any finite positive value &. Clearly, 


(4.1) 0 < | oft, 6) | S 2p(h, 6) + o(—b, 8)] 


for all values ¢ for which |¢| S &. The boundedness of g(é , 6) and g(—t, 6) 
follows from Assumption 1. Hence ¢g(¢, @) is a bounded function of @ and ¢ 
over any bounded ¢-domain. 

Let {lm, On} (m = 1, 2, ---, ad inf.) be a sequence of pairs converging to 
the pair (t’, 6’). We have 


(4.2) (tm » Om) — g(t’, 8) = [eltm, Om) — (ly Om)] + [e(t’, Om) — g(t’, 6’)]. 
The second expression in brackets converges to zero by continuity in @. Thus 
the first part of Theorem 4.1 is proved if we show that 
(4.3) lim [g(tm 5 9m) — o(t’, Om)| = 0. 


mo 


It follows from Assumption 1 that for any given @, g(t, @) is an analytic func- 
tion with no singularities in any finite (-domain. Hence we can expand ¢(tm, 


6) in a Taylor series around ¢ = VU, i.e. 
= 1 (doll, On 

(4.4) g(tm 5 Am) co g(t’, Am) — 2. “s f of — ) ) it iad t')*. 
k= k! ot* t=<t’ 


Let r be a given positive value. Because of the boundedness of g(¢, @) in any 
finite t-domain, there exists a constant M such that | g(t, 6) | < M for all 6 
and for all ¢ in the domain ;¢ — t’| < r. From the Cauchy integral formula 
for an analytic function it follows that 


| a* v(t, Am) | —_M 


ot* tme’| 


. 1 
(4.5) kl 


From (4.4) and (4.5) we obtain 


(4.6) | etm, On) — oft’, On) | SM De 


Equation (4.3) is an immediate consequence of (4.6). This proves the first 
half of Theorem 4.1. . 

Let C be a circle in the complex ¢-plane with finite radius and center at the 
origin. According to the Cauchy integral formula we have 


k 
(4.7) 1 fob) y ~ 1H elt,®)| 
Qri Je tH ki at® emo 








1 ' 
= i! E,(z') ° 


Since g(t, @) is continuous in ¢ and @ jointly, the integral on the left hand side of 
(4.7) is a continuous function of 6. This proves the second half of Theorem 4.1. 


the 


tior 
(5. 
has 
zer 
106 


on 


pu 
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fu 
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5. Some lemmas. In this section we shall prove several lemmas which will 
then be used to derive the results contained in sections 6 and 8. 


Lemma 5.1. It follows from assumptions 1 and 2 that for any given @ the equa- 
tion in t 


(5.1) g(t, #) = 1 


has exactly two real roots, one of which is zero. The other real root is different from 
zero if Ey(z) ~ 0. If E,(z) = 0, both roots are equal to zero, 7.e., zero is a double 
root of (5.1). 


This lemma is essentially the same as Lemma 2 in [2] and the proof is therefore 
omitted.’ 

Let h(@) denote the non-zero root of (5.1), if Ey(z) = 0. If E,(z) = 0, we 
put h(@) = 0. 

In what follows the variable ¢ will be restricted to real values, unless the 
contrary is explicitly stated. 

Lemma 5.2. It follows from assumptions 1 and 2 that h(@) is a continuous 
function of @. 

Proor: It follows from assumption 2 that 


(5.2) lim g(t, é) = +« 
t-te 
uniformly in 6. Hence, since by definition 


g[h(@), 6] = 1 


identically in 6, h(@) must be a bounded function of 6. 
Let {06,.} be a sequence of parameter values which converges to 6*. From 
Theorem 4.1 it follows that 


(5.3) lim [o(t, Am) = o(t, 6*)| = 0 


mo 


uniformly in ¢ over any finite interval. Since h(@) is bounded, we obtain from 
(5.3) 


(5.4) lim {¢lh(6m), Om] — elh(@n), 6*]} = 0. 


Since g[h(@m), Om] = 1, it follows from (5.4) that 


lim g[h(O,), *] = 1. 


m-*o 


It follows from assumption 1 that for any limit point h of the bounded se- 
quence {h(@m)} (m = 1, 2, ---, ad inf.) we have 


3 Condition IV of Lemma 2 in [2] is not postulated here, since the validity of this con- 
dition is implied by assumption 1. Condition IV could have been omitted also in [2], 
since it follows from condition ITI. 
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(5.5) g(h, 6*) = 1 


If h(6*) = 0, then equation ¢(t, 6*) = 1 has the only root t = 0. Conse. 
quently, all limit points of {h(6,,.)} must be equal to zero, that is 
(5.6) lim h(@,) =O if h(6*) = 0. 

Now let us assume that h(6*) ~ 0. Since the second derivative of g(t, 6) 
with respect to ¢ is positive, it can be seen that y(t, 0) < 1 for values ¢ in the open 
interval (0, h(@)), and g(t, 6) > 1 for any ¢ outside the closed interval [0, h(6)]. 
Hence, g(t, 6) < 1 implies that | h(6)| > |¢| and h(@) and ¢ have the same 
sign. Now let &, be a value in the open interval (0, h(@*)). Then we have 


(5.7) g(t, *) <1 
It follows from assumption 1 that 
(5.8) e(to, Om) <1 
for sufficiently large m. Hence h(@,,) and & have the same sign and 
(5.9) | h(@m) | > | to| 


Inequality (5.9) implies that zero cannot be a limit point of the sequence 
{h(@m)}. Since y(t, 6*) = 1 has only the roots t = 0 and t = h(6*), it follows 
from (5.5) that the sequence {h(@,,.)} cannot have a limit point different from 
h(6*). Thus, 


(5.10) lim h(@m) = h(6*) 


and Lemma 5.2 is proved. 

Lemma 5.3. It follows from assumption 1 that for any given t, E,(e'*") is 
a bounded function of 6. 

Proor: We have 


(5.11) E,(e'') < E,(e* + &*) = oft, 0) + o(—t, 8) 


It follows from assumption 1 that ¢(t, 6) and ¢(—t, @) are bounded functions 
of 6. Hence Lemma 5.3 is proved. 

Lemma 5.4. Let 6’ be a value of 6 such that E,-(z) = 0, but E,(z) ¥ 0 for all 
6 + 6 in an open interval containing 6’. It follows from assumptions 1 and 2 
that 


e 2E,(z) ee 2 
(5.12) lim (-7H2)) = Ey-(z). 


Proor: We have 


(5.13) ht 14 hoe + oo e+ no of unos 


whereOQ <= uSi1. Hence 





Sit 


(5 
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(5.14) E,(e*) = 1 de h(0) E¢(z) 4 = E,(z’) + OT nity, 
Since E,(e**) = 1, we obtain from (5.14) 
(6.15) RO) Bale) + POT payer + VON Byer = 0. 


We shall consider only values @ for which h(@) ~ 0. For such values of 8, 
also E,(z) ~ 0. Dividing (5.15) by h(6)E,(z), we obtain 


h h 
(5.16) 1+ a | Bae + a Buoy = 0. 


Let & be an upper bound of | h(@) | with respect to 6. Then for a suitably 
chosen constant C we have 


(5.17) iw | < Ce! tol. 


From this and Lemma 5.3 it follows that E,(z*e*) is a bounded function 
of 0. 


Because of the continuity of h(@) we have 


(5.18) lim h(6) = 0. 
0-0" 


" Lemma 5.4 follows from (5.16), (5.18), the boundedness of E,(z*e“"*) and 
the fact that E,(z’) is a continuous function of @ and E,.(2’) > 0. 

Lemma 5.5. From assumptions 1 and 2 it follows that for any given t, E,(e'7*') 
exists and is a bounded function of 0. 

Proor: It is sufficient to show that E,(e'") is a bounded function of @ for 
any t, since 


(5.19) eltZal < gttn 4 ote 


bit+znt at+znt 


Clearly, e'”" lies between ¢ and e Hence Lemma 5.5 is proved if 
we show that E,(e*"‘) is a bounded function of 6. 

It follows from Assumption 2 that there exists a positive integer k and a 
positive constant g such that 


(5.20) P(lat-e:-+a|2a—b|@) 29 
for all 6. For any positive integer m and for any real values \1 < A: we have 


Pl(m — 1k < n S mk | 6] 


(5.21) Pm — Dk < n/6] 2g (m = 1, 2, -++, ad inf.) 
and 

Pl(m — 1)k <n S mk &X S en < ro| 4) 
(5.22) Pl(m — 1)k < n| 9] 


s1-[1— PQ. Sz <>r.|o}*. 
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Hence 


Pl(m — 1)k <n S mk &M S 2n < 2 | 4] 


(5.23) P{(m — 1)k < n S mk| 0] 





—1-[1— PQ. Sz <).|6 
= gq . 


Multiplying (5.23) by P[(m — 1)k < n S mk| 6] and summing with respect 
to m we obtain 


(5.24) P(\ S zn < 2/6) S 1—[1- PQ. S2< Ar 
g 
From (5.24) it follows readily that 
(5.25) P(\ S 2n < A2| 4) 


PQu Sz< A2| 8) 
is a bounded function of \;, \2 and 6. Let A be an upper bound of the ratio 
(5.25). Then 
(5.26) E,(e"**) S$ AE,(e") = Ag(t, 6). 


Because of Assumption 1, g(t, 6) is a bounded function of @. Hence also 
E,(e“") is bounded and Lemma 5.5 is proved. 


6. The limiting value of E,(n) when @ approaches a value 6’ for which 
E,-(z2) = 0. In this section we shall prove the following theorem: 

THEOREM 6.1. Let 6’ be a value of @ such that E,.(z) = 0, but E,(z) ¥ 0 for 
all 6 ¥ 6 in an open interval containing 6’. If assumptions 1 and 2 hold, we have 








: Eo(Zn?) - 
(6.1) lim | Bain = Ey (2) | = 
Proor: Consider the Taylor expansion 
2 3 
(6.2) ehO% = 1 + W(O)Zn + aT Zi, + ee Zhe 


where 0 <A <1. It was shown in [2] (p. 286) that 
(6.3) Ee * = 1. 


Hence, taking expected values on both sides of (6.2), we obtain 
2 3 
64) Ea Zy) + PO! pyar) + MON nyasero* = 0. 


We consider only values of @ for which E,(z) ~ 0. For such values, also 
h(@) + 0. Thus, we can divide both sides of (6.4) by h(@)EH,(z). We then 
obtain 
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= Es(Zn) , _h(6) 
E¢(z) 2E¢(z) 


it was shown in [1] (p. 142) that 








| Boz, +O) nyc quran | = 0. 








es E(Z,) 
(6.6) E,(n) = Exe) ° 
Hence 
» h(6) 2 h(@) 3 AA) Zn, | _ 
(6.7) E,(n) + Ez) E@ + 3 E,(Z,€ | = (. 


Let é be an upper bound of | h(6) |. Then for a properly chosen constant C 
we have 


(6.8) er | < Cel t02n! 


From this and Lemma 5.5 it follows that E,(Z°,e"") is a bounded function 
of 6. Since lim h(@) = 0 and E,(Z’,) has a positive lower bound, Theorem 
6-6’ 


6.1 follows from 6.7, Lemma 5.4 and Theorem 4.1. 

If lim E,Z%, = Ey-Z’, , Theorem 6.1 gives‘ 

6— 6’ 

Ey(Zn) 
Eo:(z?) ; 

Limits for E»-(n) can be obtained by computing limits for E,(Z%,). In the 
next section we shall give a general method for obtaining limits for E,[y(Z,)], 
where ¥(Z,,) is any function of Z, . 


(6.9) Ey:(n) = 





7. Determination of lower and upper limits for the expected value of any 
function of Z,. Let ¥(Z,) be a function of Z,. Limits for E,[y(Z,)] may be 
determined as follows: First we determine limits for E,[y(Z,) |Z, 2 aj. Letr 
be a positive variable. Clearly, for any given value r we have 


11) - Bal@Z.) |Zan a —redZ. 20 = Bie —+r+al|s27 
From (7.1) we obtain the limits 


g.l.b. Elva —r+z)|z227r) S Ely(Z,)|Z, = a] 

0<r<a— 

(7.2) < lub. Ely(a —7r+2z) lz 
b 


0<r<a— 


IV 


rl. 
Limits for E,[y(Z,) |Z, < 6] can be obtained in a similar way. Again, let 

r be a positive variable. For any value of r we have 

(7.3) ElW(Z.) |Z. S bandZ,1 = b+ rr] = Ely(b+r+2)|2 5 -r 

Hence we obtain the limits 


4 The validity of (6.9) was shown by the author [3] using an entirely different method. 
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gl.b. Ely(o +r + z)\z2 S$ —rl S Ely(Z,) |Z, S 0] 


0<r<a—b 


(7.4) < lub. Ely(b+r+2)\z < —7). 


0<r<a—b 
Since 


(7.5) Egly(Z,)] = P(Z, = a) Edly(Z,) | Z, = a] + P(Z, S b)Esy(Z,) |Z, < bl, 


a lower (upper) limit for E,[y(Z,)] can be obtained, by replacing the condi- 
tional expected values on the right hand side of (7.5) by their lower (upper) 
limits given in (7.2) and (7.4). 


8. Limits for E,(n) when h(@) is near but unequal to zero. Let 6’ be a value 
of @ for which h(6’) = 0. In this section we shall derive limits for E,(n) which 
will generally be close to each other for values @ in a small neighborhood of 6’. 

From equation (6.7) we obtain 


+ (n\ — (9) 2 , A(d) 3 MOZn | 
(8.1) Ey(n) = 2E (2) ) EZ + 3 E,(Z, € ) 
where 0 < \ < 1. Thus, limits for E,(n) can be obtained by deriving limits 


for EZ’, a E,(Z°,e""). Limits for E,Z?, can be obtained by using the 
method described in section 7. 

If @ is near 6’, any crude limits for E,(Z 5h 2n) will serve the purpose, since, 
as has been shown in section 6, Ey(Z°,e" ") is bounded and lim h(6) = 0. 


Limits for E,(Z°,e" 7") can be obtained as follows: For simplicity, let us 
assume that h(@) > 0. Then 


(8.2) fag *2gy™ (h(@) > 0) 


Thus, to determine limits for E,(Z5,e" ), it is sufficient to determine a lower 
limit for Z,(Z*,) and an upper limit for E,(Zie""). The latter limits may be 
derived by using the method given in section 7. 

If h(@) < 0, we have 


r73 3 Ah(O) Z r73 _h(8) Z, 
(8.3) Zn 2 Zne “air 


and a similar procedure will yield the desired limits for E,(Z5,e" ”"). 

It should be emphasized that the limits of E,(n), as given in this section, 
can be expected to be close only if h(@) is near zero. For values of @ for which 
h(@) is not near zero, the limits of Z,(n) given in [1] can be used. 
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THE EFFICIENCY OF THE MEAN MOVING RANGE 
By Paut G. Hore. 


University of California at Los Angeles 


Summary. In studying the variation of a variable subject to erratic trend 
effects, it is customary to employ as a measure of variation a statistic that 
eliminates most of such effects. It is shown in this paper that the statistic 
w= | tia. — 2: |W2/2(n — 1) is nearly as efficient as the statistic 
& = Dol” (eins — 2,)?/(n — 1) that is customarily employed. The asymptotic 
variance of w is obtained by integration techniques; the proof of the asymptotic 
normality of w is based upon a theorem of S. Bernstein on the asymptotic dis- 
tribution of sums of dependent variables. The method of proof is sufficiently 
general to prove the asymptotic normality of w, and of 6°, for x having a dis- 
tribution for which the third absolute moment exists. 


1. Introduction. Let 2, 2, ---, x, denote a random sample of size n from 
a population with a continuous distribution function f(x). If a measure of the 
variability of x is desired, it is customary to select the familiar statistic 


YS (a — 


(1) s = __ 


? 


or its positive square root s, as an estimate of the corresponding theoretical 
measure of variability. 

If, however, it is known that the variable z is subject to trend effects and that 
f(x) represents the distribution of x without such effects, then s’ will not serve 
as a satisfactory measure of variability about the trend. In order to eliminate 
the influence of trends, it is helpful to employ statistics that capitalize on the 
time order relationships of the observations. There are several statistics of 
this type available, although most of them make no pretense of completely 
eliminating trend effects, even if the trend is linear. 

Perhaps the best known among statistics of the desired type is the mean 
square successive difference, 


n—1 
9 
D (tins — 2)” 
i i=1 : 


(2) o og 


This measure of variation has been studied extensively in recent years. Among 
the results of these investigations is a determination [1] of the efficiency of 
5°/2 as an estimate of o” for a normally distributed variable when no trend exists. 
A closely related measure of variation that is not so well known is the mean 
moving range of successive pairs of observations, 
475 
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n—1 
; | Lena — XB | 
(3) Wt Tlerenemne « 
n—l 
Although w appears [1] to have been used by ballisticians, very little seems to 
be known concerning the relative merits of 5 and w. Since w is considerably 
easier to calculate than 4’, it would be preferred to & for applications in which 
computational advantages are important. However, one would hardly allow 
such advantages to dominate a choice unless 6° and w were about equally efficient 
as estimates of variation. 
The purpose of this paper is to determine the efficiency of w and to study 
efficiency properties of generalizations of w. 


2. Definition of efficiency. The definition that will be used in this paper 
[2] may be stated in the following manner. Let @ be a parameter, or a function 
of parameters, of the distribution function f(x). Let 7 be a statistic for which 
there exists a number yp such that 


t= ~ /n(T — 86) 
is asymptotically normally distributed with zero mean and variance yp. Let 
T’ be any other statistic for which there exists a number yw’ such that 
t’ = V/n(T’ — 8) 
° ° ° ° : “ . 2 one 
is asymptotically normally distributed with zero mean and variance vp”. Then 


T is said to be an efficient estimate of @ provided that » < wy’ for all possible 
choices of 7’, and the efficiency of any particular 7” is defined to be 


(4) Sp = (4). 
u 


In order to determine the efficiency of a statistic, it is therefore necessary 
to first demonstrate its asymptotic normal distribution and then calculate its 
asymptotic variance. This order of procedure will be reversed in the following 
determination of the efficiency of w. 


3. Variance of w. Let x be normally distributed with zero mean and unit 
variance. Then the mean of w, where w is given by (3), may be evaluated as 
follows: 


E(w) = E|22— x | 


1 ’ | —}(xr2+422 
5 | te — a | @ It? day dae 
T J—c 


: 2 2 ™ (x2/2 . (x2/2) 
—(r2/ =_ / —(2r2 
= e tf (x2 — xe i” dz, +/ (a1 — xe)e 71" ds, | dz 
— OO z2 


Qr Jw 





. nS 
ieee eeeeeneneeeemnenneneneeene 





LT 
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1 oc z2Q 3 
—(r2/2 -(r2/2 —(x2/2 (22/2 
= e = a | ei dx, — n | ei ) dx, + 2e (23/2) dx 
Qr — 3 —c Z2 


a "2 _(a9/2) 
—(12/2 2/2 -(x2/2 
<- e 3 2] 2 | e 71" du +e *2 ; die . 
T J-o 0 


If integration by parts is performed on the first integral with 
v2 ( a Bs 
% = [ ei day and dv = x2.¢ 72" dae, 
0 


the w term will vanish at both limits and E(w) will reduce to 


oo 


: el sd 
(5) E(w) = : i. dx» oY ot 


Tv 
This result could have been obtained more easily by other methods, but some 
of the integrals involved will be needed later. 

For the purpose of computing the second moment of w, it is convenient 
to separate the independent and dependent product terms of w’. Since there 
are 2(n — 2) of the latter, E(w’) may be expressed in the form 
(n — 1)°E(w’) (n—1)E|a—1 |’ + 2(n — 2)E | x2 — m1 || x3 — 2 | 


+ (n — 2)(n —- 3)E” | xe — 24 


I 


But 

E|\a. — 1|° = E(xe — 2)° = E(as) + E(zi) = 2. 
Consequently, because of (5), 
(n — 1)°E(w*) = 2(n — 2)E} a2 — % || 23 — t2| + 2(n — 1) 


+ 4(n — 2)(n — 3)/z. 


(6) 


Now consider the evaluation of the product term 


oo 
E an “" Za || 3 Xo); = (27) 3 [| | | te De || &3 — Xe 
— 


By means of the expressions that were used to give (5), this triple integral may 
be reduced in the following manner: 


co 
Fin—alln—al = @07 | | ty — a led? 
— co 
7 st iat 
2 2 [ e “i™ dx; + e€ oy | dx3 dx 
0 
* (22/2) ” 1w/ 2/2) | 
= (2n) a e.g m | e2 da + 2H | dare 

— 00 0 


00 ” rg ss 2 
4(2n)7 [ "hee a | ([ ore dz.) 


z2 
—(x2/2 —(x2/2 2 
+ Qroe~ 2! te et! da, + | dx. 
0 


—}(r2+a2+272) 
1 2 3 


¢ 





dx, dx2dx3. 


tor 
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These three integrals, without their constant factors, will be denoted by J, , 
Iz, and I;, respectively. J, may be evaluated by integrating by parts with 


zo 2 
(22/2) ~(22/2 
Uu = 2 ([ “i az.) and dv = x.e 72” dao. 
0 


The wy term will vanish at both limits; consequently 


oo z2 re 2 
to / oP Ez (23/2) I e-2) de ([ on (aHD dzs) Jan 
L— 20 0 0 
co z2 eo x2 2 
| X2 € = ca dx, dxo + | é —_— ([ é — az.) dx . 
Lo — 00 0 


The first of these two integrals may be evaluated in the same manner as the 
first integral preceding (5). The second integral may be evaluated by making 
the change of variable 
x2 
“= I et da. 
0 


As a result of such manipulations, 


i we ie 


(7) 





It will be observed that J. is the same as the first integral of (7) and that Is 
is available in tables; hence 


E | x2 — a1 || 23 — 22 | 


(8) 
= 4(2n)7} | MF + aV/ Qe 





4 V6q vel 
3 6 3 3 

If (8) is substituted in E(w’) will reduce to 
(92) E(w) = 2 =) ape | + ave | 8+ 


rg a(n — 1) 


1 42Vv3° 
T 


Since o2, = E(w’) — E’(w), (9) and (5) will yield the following desired variance 
of w, 


mm é£<-". | (3 -me=s yn + (eos ame :)|. 
(n — 1) T T 


4, Efficiency of w. Now let x be normally distributed with mean m and 
variance o. Then the mean of w as given by (5) will be multiplied by o and 
the variance of w as given by (10) will be multiplied by o’; consequently z = 
w \/7x/2 will serve as an unbiased estimate of c. In the next section it will 
be shown that 


= Vn(z — o) 
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possesses an asymptotic normal distribution. From (10) and section 2, it there- 
fore follows that the asymptotic variance, a’ that is needed to determine the 
efficiency of z is given by 


v=” 2(4 2V3 — 6 
4 T 


Now it is known that for x normally distributed s, as defined by (1), is an 
efficient estimate of « with uy” = 3; consequently, because of (4), the efficiency 
of z as an estimate of o is given by 


| 1 
i, «> = .605. 


2 a 
(11) 2 (= +V3-3 
In [1] it was shown that for x normally distributed 5°/2 was an unbiased 
estimate of o° and, assuming the normality of its asymptotic distribution, 
that the efficiency of 6/2 as an estimate of o was 2/3. Thus, 2 = wv/x/2 


possesses very nearly the same efficiency as a measure of variation of a normal 
: 2 
variable as 6/2 does. 


5. Asymptotic distribution of mean moving ranges. Although the efficiency 
obtained in the preceding section requires for its validity merely a demonstra- 
tion that for x normally distributed w possesses an asymptotic normal dis- 
tribution, it will be shown in this section that general mean moving ranges of 
a continuous variable x possess asymptotic normal distributions provided only 
that x possesses a third absolute moment. 

Let r; denote the range of the observations from 2; to xi+,1. Then the 
variable 


. 11+ fe tees + eet 
- =o n—-k+1 ~~ 
will represent a generalized mean moving range, of which w will be a special 
case when k = 2. 

A proof of the asymptotic property of W can be constructed as an applica- 
tion of a general theorem of 8. Bernstein [3]. Since his theorem is long and 
involves much explanation of notation, a simplified version of it that is sufficient 
to cover this application, and indeed many similar applications, will be given. 

Let yi, Y2, ***, Ym denote m variables for which the third absolute moments 
are bounded and let 


Sm = Yr + Y2 + oes + Ym. 


Then Bernstein’s theorem implies that if there exist constants ¢: , C2, cs, and cy 
such that 


(a) cm < os, < cam, 
and 
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(b) y, and y+, are independently distributed for 
g > cym, cy < 3, 
then 
Sm — E(Sm) 
Os 


m 


possesses an asymptotic normal distribution with zero mean and unit variance. 
Consider the application of this theorem to R = (n —k + 1)W. The vari- 

ance of R may be expressed in compact form by means of the techniques of 

section 3. Since r; is the range of k consecutive observations, it is clear that 


E(riti+g) = E*(r) 
if g > k. Furthermore, for subscripts for which it is defined, E(riri+,) will 
be independent of 7. These two properties may be used to collect terms in the 
expansion of E(R’) to give ° 
k-2 

E(R’) = (n - & + 1) E(ri) + 2 > (n —k-—- t) E(1ry 241) 

+ (n — 2k + 1)(n — 2k + 2)E(n)). 
Consequently, 


k—2 
(13) on = (n—k+ 1)E(ri) +2 p> (n — k — 1) E(ri 7244) 


+ [n(1 — 2k) + (& — 1)(8k — DIE (nr). 


From the definition of the correlation coefficient and the fact that a correlation 
coefficient cannot exceed one, it follows that 


E(riresi) S E(r1)E (124i) + or Or9 45 
< E*(n) + o7, . 
If this inequality is applied to (13), 
oe < (n—k+ 1E(ri) + (k — 1)(2n — 3k + 2)[E(r) + o7,] + [n(1 — 2k) 
+ (k — 13k — DIE(r) 

<(n —k + IEC? — F(r)] + (b — 1)(Qn — 38k + 2)0? 

< [n(2k — 1) — (k — 18k — Dor 

< 2koi(n — k + 1). 


Thus, for a fixed k the right inequality in (a) of Bernstein’s modified theorem 
is satisfied. 

For the purpose of demonstrating that the left inequality in (a) is also satis- 
fied, consider the following application of Schwarz’s inequality. Let 


—_ a Oo, oe wd 
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(14) G(tp, +++, %) = | wa [ ro Fee) +++ f(Terpa) ep. +++ ALerp-1, 


where f(x) denotes the distribution function of the variable x and the range of 
integration in this and subsequent integrals is from — © to ». Since r, and 
f are continuous non-negative functions, this integral is a positive function of 
the indicated variables. Then, denoting G(x,, ---, 2,) by G, it follows from 
Schwarz’s inequality that 


[= il — [ nse ++ f(y) dx -+- acs | 
(15) = il oe / {rif (x1) +++ f(x,)G}? {ri f(a) +++ f(a)G"}' day --- aa] 
< [-- [nse ++ flas)Gder, ++ dar | — | rif (a1)+ + -f(ax)G™ 


dx, +++ dxy,. 


The two integrals of this inequality will be denoted by 7, and I, , respectively. 
If the value of G given by (14) is substituted in J, , it will be observed that 


(16) a / ls / ee eT ee 


Now I, may be written in the form 


1, = | oe [a0 ‘ee f(x,)Gr il abe [nse ++ f(tp-1) day +++ dps 
dxtp-++ dry. 


Since the x; possess the same distribution function and 7; is the range of the 
variables from x; to 2; , the integral in brackets is equivalent to the integral 
defining G in (14); hence 


(17) Ip = [+++ | Hes) +++ SEG dey ++ dry = 1. 


If (16) and (17) are applied to inequality (15), they will yield the inequality 


| foe ffeen + fle ae bil az, | 


< [++ [rrp f le) +++ Sense adder -dteee 
In statistical language, this inequality states that 
E*(r,) < E(riy), 


or, what is equivalent, that 
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(18) E*(n) < E(rir)). 
If (18) is applied to (13), 
oe > (n— k + 1)E(ri) + (& — 1)(2n — 3k + 2)E(n) + [n(1 — 2k) 
+ (k — 1)(3k — 1)]E*(n) 


IV 


(n — k + 1)[E(ri) — E*(r1)] 
on(n — k + 1). 


Thus, for a fixed k the left inequality in (a) of the theorem is also satisfied, and 
it merely remains to be shown that condition (b) is satisfied. 

For k fixed, 7; and r;;, will be independently distributed provided that g > k. 
But if cz > k, then e;(n — k +1) > kforO < c4 < } becausen —k +1>1; 
consequently r; and r;+, will be independently distributed for g > c3(n — k + 1)", 
where 0 < c, < 3. Thus, conditions (a) and (b) are both satisfied by R. Since 
R= (n—k+ 1)W, it therefore follows that 


W — E(W) 


Ow 


IV 


(19) 


possesses an asymptotic normal distribution with zero mean and unit variance 
provided only that x possesses a continuous distribution function for which the 
third absolute moment exists. The existence of the third absolute moment for 
x insures the existence of the same moment for 7; . 

If k = 2, W reduces to w, and therefore the validity of (11) is assured. 


6. Other asymptotic distributions. The only property of the range employed 
in the proof of the preceding section was its positive nature; consequently the 
proof is applicable to moving means of other dependent statistics that are posi- 
tive and possess third absolute moments. 

For example, the preceding proof can be applied to 6’ to show that 6 possesses 
an asymptotic normal distribution provided only that the sixth moment of x 
exists. In the study [1] of the efficiency of 6° for x normally distributed, no proof 
was given of its asymptotic property. The preceding proof could be used in 
studying the efficiency of 6°, or obvious generalizations of it, as measure of 
variation for non-normal populations. The normality of the asymptotic dis- 
tribution of the serial correlation coefficient could also be verified by means 
of this proof. ; 
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CONFIDENCE LIMITS FOR THE FRACTION OF A NORMAL 
POPULATION WHICH LIES BETWEEN TWO GIVEN LIMITS’ 


By J. WoLrow!Tz 

Columbia University 
Summary. Let u and o be the unknown mean and variance, respectively, 
of a normally distributed population on which N independent observations 


%1,°**, ty have been made. Let Li and l:, Li; < Le, anda,0 < a < 1, be 
given constants. We define the following symbols: 


() y= (Vir) [exp }— 5 ay 
(b) ° ¢=N"2z; 
(c) s = (N — 1)72(a; — 2) 


(d) xi-« as that number for which P{x” < xj_2} = 1 — @ where x’ has N — 1 
degrees of freedom. 





— § 
(e) w= J/N-1 
X1—a 
-a/s (Le—z)/w 1 . 
° p= cay fo" welds 
(L1—2) /w 


It is proved that, under restrictions stated precisely below, and before the 
observations are made, the probability that D < y differs from a by a number 
which can be made arbitrarily small by making N sufficiently large. Thus an 
approximate (large sample) lower confidence limit for y is obtained. Similar 
methods can be applied to obtain upper and two-sided confidence limits. 

A problem raised by the present paper (but not attacked here) is to investi- 
gate the rapidity of approach to a of P{D < vy}. It would perhaps be useful 
to obtain a series for the latter in powers of N~*; the first term of such an ex- 
pansion is obtained here. 


1Formula (5.1) of the present paper was given without proof by the author in July, 
1945, in solution of a problem put to him by Dr. M. A. Girshick. At the time, both were 
members of the Statistical Research Group, formed in the Division of War Research of 
Columbia University under contract with the National Defense Research Committee of 
the Office of Scientific Research and Development. The validation of formula (5.1) in 
all rigor as it is given in the present paper was constructed by the author after he was no 
longer a member of the Statistical Research Group. 

In January, 1945, Professor A. Wald, then a consultant to the Statistical Research 
Group, and the present author jointly submitted to the Group an unpublished memorandum 
(#410) entitled ‘‘Acceptance Regions Which Involve the Normal Distribution and Large 
Sample Sizes.’”? While this memorandum dealt with a different problem, its ideas were 
logically antecedent to formula (5.1). The present author wishes to express his indebted- 
ness to this memorandum and to his coileague Professor Wald. 
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1. The problem. Let » and o be the unknown mean and variance, respec- 
tively, of a normally distributed population on which the N independent ob- 
servations 21, 2, ---, 2» have been made. Let L; and Le be given constants 
with I, < L2.. We then have that 


2 el. 
1= 771, 0 {- 3(4=*) fay 
is the fraction of the normal population which lies between ZL; and L.. The 
problem considered in this paper is to construct a lower confidence limit for the 
unknown 7, when N is large. An upper confidence limit or two-sided confidence 
limits may be constructed in a manner very similar to that described in the 
present paper. Since the construction of a lower limit is the problem which 
occurs most often in practice the discussion will be centered on it. 

A lower (confidence) limit on y with confidence coefficient a is a function 
D(a, -+-, tw) of the observations 21, ---, ty with the property that, before 
the observations are made, the probability is a that D(m, ---, tv) < y. In 
any specific application it is unknown whether this last inequality holds, because 
y is unknown. However, one who proceeds as if this inequality were true is 
using a procedure which will give correct results 100a% of the time in the long run. 

When either L; = —~ or Ll, = + the sialiaee by use of the non-central ¢ 
distribution, is well known. For a description of the procedure and necessary 
tables the reader is referred to [1]. 


2. Acceptance regions. Let yo be any value of the parameter y. To yo 
there correspond infinitely many couples (u, o) with the property that the 
normal distributions characterized by these couples all have a fraction yo lying 
between L; and L, ; we may write this symbolically by saying that the couples 
(u, «) satisfy 


(2.1) y(u, o) = Yo. 


The construction of confidence regions is equivalent to the construction, for 
every yo, of an acceptance region R(yo) in the N-dimensional Euclidean space, 
with the property that every normal distribution whose parameters uw and o 
satisfy (2.1) assigns to R(yo) the constant probability a. While this property 
of similarity (cf. [2]) is sufficient for the construction of confidence regions, 
additional properties of the acceptance regions R(yo) are needed in order that 
the confidence region be an interval or that the upper confidence limit be always 
one (i.e., that the confidence limits turn out to be a lower limit only), or to insure 
other features deemed desirable. 

It is easy to construct acceptance regions which will fulfill the condition of 
similarity. As an example, consider the case N = 3 for convenience. Let bi, 
be , bs be a number triple such that bi: + b. + b3 = 0. Let R(yo), for any given 
vo, 9 < yo < 1, consist of all the points 2 , x2 , x3 which are such that the absolute 
value of the angle y(—z < w < =) between the vector (b; , be , bs) and the vector 
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(11 — X,X%2 — Z, x3 — £) does not lie between wayo and t + mwa(yo — 1). (We 


N 
define, in general, » x; = N&. The points (a , x2, x3) for which 2; = x2 = 23 
may be disregarded, since their probability is zero when the distribution is con- 
tinuous.) One readily verifies that the probability of R(yo) for any yo is a, 
no matter what uw and o are, and hence this is true in particular for the pairs 
which satisfy (2.1). 

The above method of constructing acceptance regions yields confidence regions 
which, while they cover the unknown y with confidence coefficient a, are not 
very meaningful otherwise. The fact that the probability of R(yo) is a whether 
or not (u, o) satisfies (2.1) is already indicative of their lack of discrimination. 
Since % and s (where s is defined by . 


N 
ns = > (x; — 2) 
1 


and n = N — 1) are sufficient estimates of » and o, which in turn determine y, 
it is clear that desirable confidence regions should be functions only of # and s. 
Consequently our first task must be to construct the acceptance regions R(yo) 
in the Z, s plane. In the present paper we construct in the Z, s plane regions 
R(yo) which have the property that their probability, under any normal dis- 
tribution whose parameters satisfy (2.1), differs from the prescribed a by a quan- 
tity which is bounded in absolute value for all yo, in such a way that the bound 
approaches zero as N increases. Thus when the sample number is sizeable we 
can obtain confidence regions for y which correspond to a confidence coefficient 
which differs little from a. Finally, the acceptance regions R(yo) which we 
shall construct will be such that the confidence region will be always an interval, 


and the upper limit will always be 1, i.e., we will construct a lower confidence 
limit for y. 


3. Construction of regions R(yo) in the Zz, s plane. First we describe two 
assumptions which we shall make. It is believed that these are reasonable 
from the practical standpoint and are satisfied in most actual investigations 
where the present problem arises. Mathematically their purpose is to enable 
us to secure a uniform bound on the difference between a and the probability 
of R(yo) (for all yo) under all couples (u, ¢) which satisfy (2.1). 

ASSUMPTION 1: There exists a positive d such that 


Ijtd<yup<L-—d. 


In most practical cases where the present problem will occur y will be larger 
than 4. If the latter is the case and uw were very near either L; or Lz, then o 
would have to be very small. In that case other methods would have to be 
used in the solution of the practical problem. The present paper deals with 
the situation, unfortunately only too common in practice, where o is not too 
small. Assumption 1 puts a lower bound on o for any given value yo. (The 
bound is a function of yo). 
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AssuMPTION 2: The standard deviation o is less than a positive number C. 

In most practical problems such an upper bound can reasonably be set. 
Naturally, the larger d and the smaller C the more a priori information is at 
our disposal, the closer are our approximations and the narrower our limits. 
The effect of Assumptions 1 and 2 is to place a lower limit G on y where 


G = y(Li + d, C) = y(Le al d, C). 


Let yo be any positive number such that G < y <1. For an & such that 
I, < & < Ix, let r(%, yo) be the positive number such that 


¥(%, T(Z, Yo)) = Yo- 
We define xj_2 to be that number for which 
P(x’ < xi-a) = 1-2, 


where x” has n degrees of freedom and P is the probability of the relation in 
parentheses. The number xi-2 may be found in tables of the x’-distribution 
if the value of a is one of those in common use. Finally define 


2 
g(z, Yo) = r(z, vo) V = 
n 


The acceptance regions R(yo), G < yo < 1, which we shall employ, are defined 
as follows for any yo, G < yo <1: 


N<si<ch 


s = g(Z, Yo). 


4. Proof that P{R(yo)} ~ a. This section will be devoted to a proof of the 
following: 

THEeorEM. Let R(yo) be as defined in Section 3 forG <yo <1. Let the assump- 
tions 1 and 2 of Section 3 be fulfilled. Then the absolute value of the difference 
between a and the probability of R(yo) under any couple (u, «) which satisfies (2.1) 
as less than any arbitrarily small positive « when N is sufficiently large, 7.e., when 


N is sufficiently large, 
| P{R(yo)} —a| <e 
uniformly for all (u, o) which satisfy (2.1) withG@ < yo < 1, and which fulfill 


Assumptions 1 and 2. 
dr(Z, Yo) 


Lema 1. exists in the open interval lL; < & < In. 


Proor: We have 
Lea lool-$(te3y 
2 eo xX - = - d 
” VJ Qar(é, Yo) I. exp { 2 \r(%, Yo) , 
1 (Lo—z)/r 1 e 
— xp <— = dy. 
V Qa fa. exp 5 








_—_ 


Qs: Qs TH 
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Differentiating with respect to Z we obtain, since r > 0, 


an +R =) =e? (1 +T =") 
Ot Oz 





with 
R - Le — fF Ly = = 
r r 
Hence 
4.1) or Ger age 
(4. az Re-B2 — Tet) * 


Since R > 0 and T < O within the open interval L; < < < I, it follows that 
Cs . ‘ 
2g exists in the entire open interval. 

LemMMA 2. In the open interval Li < & < In, 

oP {s = ¢(z, vo) } 
Oz 

exists. 

Proor: We have, with k a suitable constant, 


P=k fo dy=k yee dy. 


J/aole (r(2,70)/0)x1—-a 


Hence 


oP a —kxi-a or T* Yi» ‘wine —?r xia 
om a= o = ( o a 20° ; 
aaa ie _ oP 

Lemma 3. Let 6 be any arbitrarily small positive number. The function | oa 
of = and yo is bounded forIy+6< €< Ll, -—5,G <yo <1. 
Proor: From (4.1) we have 
or per eer 1 es ( r r ) r 
es, oT fo. ee oo Tee 
| S eer Ge FO Om a me 

| aP 








Therefore from (4.2) we have that 








is less than a constant multiplied by 





n 5. 
(*) exp (= =) and is therefore bounded. 


o 20? 
PROOF OF THE THEOREM: From Lemma 3 and the Theorem of the Mean it 
follows that, in the closed interval 
d d 
~¢Cad om oe 
Ly + 2 aa ts L» 2 ’ 


the function P{s > ¢(%, yo)} is uniformly continuous in Z uniformly for all 
(u, «) which satisfy (2.1) with G@ < yo < 1. Hence for every positive ¢, there 
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exists a positive 7 < ¢ such that [li — | < 2, 


n+$<hbsh~§, 
implies 
|P{s > o(i, w)} — P{s = o(h, w)} | <4. 
For fixed arbitrary ¢, > 0 we have, when N is sufficiently large, 
Pij2#—e) <q] >1-<«¢, 
from Assumption 2 and the stochastic convergence of . Now 


P{s > y(u, Yo)} = a. 
Hence, when N is sufficiently large, 


| PiR()} ‘nd a | 4 é(1 = €2) +easat &. 


Since ¢€, and € are arbitrarily small, this proves the desired result. 


5. Construction of large sample confidence regions. The acceptance regions 
R(yo) whose size never differs from a by more than a uniform bound which 
approaches zero as N increases, readily yield a lower confidence limit for y 
(within the approximation involved). The confidence region consists of all the 
vo for which R(yo) contains the observed Z, s. Our acceptance regions R(y) 
areso constructed that, if 71 < ye , R(7v:) is entirely contained within R(y2). Hence 
the confidence region is an interval, one end of which is always unity, as was 
desired. The rule for constructing the lower confidence limit D is, therefore, 
as follows: 


a) ifi< Lorzt>llk.,thenD=G 
b) if lL, <  < Le, then 


1 (Lo—z)/w P 
(5.1) o« . I exp {—3y'} dy 


L,—z)/w 
where 
w=V/N-1--—. 
X1-a 

(The value of D may be found in a table of the normal distribution. It is easy 
to see that s = ¢(%, D), i.e., D is the smallest value of yo for which , s will still 
lie in R(yo)). 

If the statement D < y is made in a large number of cases, where the assump- 
tions are fulfilled and the sample size is large, the proportion of correct statements 
will be close to a. 
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NOTES 


This section 1s devoted to brief research and expository articles on methodology 
and other short items. 
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ON SEQUENTIAL BINOMIAL ESTIMATION 


By J. WoirowiTz 


Columbia University 


The present note, written after a reading of the very interesting paper by 
Girshick, Mosteller, and Savage [1], is for the purpose of adding a few remarks 
in the nature of a supplement. For the sake of brevity the notation and ter- 
minology of [1] are adopted in toto. 

Theorem 1 below generalizes Theorem 1 of [1]. In Theorem 2’ we formulate 
explicitly the fact which lies at the basis of the GSM method of estimation. Parts 
of the proofs of Theorems 3 and 4 of [1] are simply proofs of special cases of this 
(e.g., equation (2) of [1]). We then use this fact repeatedly in proving Theorem 
3, which states that the Girshick-Mosteller-Savage estimate is the only proper 
unbiased estimate for sequential tests defined by regions which we shall call 
doubly simple. 

A doubly simple region is defined precisely below. Intuitively we may de- 
scribe such a region as the one between two curves y = fi(x) and x = f(y), 
where fi(x) is defined and monotonically non-decreasing for all non-negative 
x, fo(y) is-defined and monotonically non-decreasing for all non-negative y, 
fi(0) > 0, f(0) > 0. If the two curves intersect, the region is finite, and the 
values of the functions f; and f2 beyond the point of intersection are of no inter- 
est. This description is of course purely heuristic, because in actual fact only 
integral values of the variables come into play, and intersection of the curves, 
for example, is not needed to make the region finite. Since the question of finite 
regions is completely settled by [1], Theorem 7, only non-finite regions remain 
to be discussed, and the precise definition given below is such as to imply that 
the region is not finite. It seems to the present writer that at least many of the 
non-finite sequential tests which may be developed for meaningful statistical 
problems will require doubly simple regions. The Wald sequential binomial 
test [2] defines such a region, which also falls within the scope of Theorem 6 of 
[1]. It is easy to see that there exist closed regions which are doubly simple 
and do not satisfy the conditions of this theorem. 

By a ‘‘proper” estimate p(a) we shall mean an estimate such that 0 < p(a) <1 
for every a. It is difficult to see how any estimate which is not proper can 
make much sense. 


A 
THEOREM 1. A sufficient condition that a region R be closed is that lim inf a 


ave VN 


< ©, where A(n) is the number of accessible points of index n. 
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Proor: The hypothesis of the theorem implies that there exist a positive 
number H and an increasing sequence of positive integers 1, N2, %3, -**, With 
the following properties: 

a) Nias > Qn; (@ = 1, 2, --- ad inf.) 

For n; sufficiently large, the conditional probability of reaching the accessible 
points on x + y = ni41, when an accessible point on x + y = n; has been 
reached, is < K < 1 by the normal approximation to the binomial distibution, 
where K is constant (and depends on H). Hence the probability of passing 
through accessible points on all members of the set x + y = n; (¢ = 1, 2, ---, L) 
approaches zero as L — ~, so that the region is closed. 

THEOREM 2. Let R be any region, B its boundary, and t = (a, b); any accessible 
point in R. Let 1,(a) be the number of paths from t to (a, y) = ae B. Let Q(t) 
be the conditional probability that a path, which has reached t, will reach the boun- 
dary B. Then 


p> L(a)p’ g” = Q)p' @’. 


THEOREM 2’. (Corollary to Theorem 2) 
If R is closed, then 


(1) X L.(a)p" gq = p’q’. 


Proor: Let k(t) be the number of paths in R from the origin to t. The 
probability of reaching ae B by a path which passes through ¢ is k(t)l:(a)p"¢q. 
The probability of reaching ¢ from the origin is k(¢)p’g*, and hence the prob- 
ability of reaching the boundary via t is Q(t)k(t)p’q’. From this the desired 
result follows. 

We now define a doubly simple region. The boundary of the region consists 
of the two infinite sequences of points 


(0, ao), (1, a1), (2, a2), «°° 
and 


(bo , 0), (bi, 1), (be, 2), °° 


where da), a1, @2, --- and bo, bi, be, --+ are two infinite non-decreasing se- 
quences of positive integers. The accessible points of the region are all points 
which can be reached by a path from the origin which does not contain a boun- 
dary point. (It is to be noted that since a boundary point is, by definition, 
a point not in the region which can be reached by a path in the region, the above 
definition implies that a doubly simple region is not finite. The reason for 
making this so has been given above.) 

THEOREM 3. Let R be a closed doubly simple region. Then f(a) is the unique 
proper unbiased estimate of p. 
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Proor: Suppose there were two proper unbiased estimates pi(a) and pe(q). 
Writing m(a) = pila) — po(a), we would have 


(2) y m(a)k(a)p"g? = 0 
with 
(3) | m(a)| <1 


First we prove 

LremMa 1. If a) > 1, then m(bb, 0) = 0. 

Proor: Let k*(a) denote the number of paths in RF from the point (0, 1) to 
the boundary point a. For all points a e B except (bo , 0) we have 





(4) bok*(a) > k(a). 
From (1), (2), (3), and (4) we have, since k(by , 0) = 1, 
| m(bo , 0) | g°? = | z m(a)k(a) pq? 
(5) a €B,ax¥%(bo,0) 
< D _ k(a)p’g? < bo do k*(a)p’g’ = bop. 
a¢éB,ax(bo,0) aeB 


Now as p — 0, the left member of the inequality (5) approaches | m(bo, 0) |, 
and the right member approaches zero. This proves Lemma 1. 

Lemma 2. For every z < do — 1, m(b,, z) = 0. 

Proor: In view of Lemma 1 it is sufficient to prove the following: 

If Z < a — 2, and if m(b,, z) = Oforz = 0,1, ---, Z — 1, then m(bz , Z) 
= 0. Let kz4:(a@) denote the number of paths in R from (0, Z + 1) to the 
boundary point a. For any point a eB whose ordinate is > Z + 1 we have 


(6) bobi aS bzkz+1(a) > k(a). 
From (1), (2), (8), and (6) we have 
(7) | m(bz , Z) | k(bz, Z)p’q’* = | Zm(a)k(a)p’g’ | < =Zk(a)p’¢? 


< bobi o7* bzrkz41(a) pg = bobi +s bzp?** 


where the summations take place over all boundary points whose ordinates are 
>Z+1. Hence 


| m(bz , Z) | k(bz , Z)q’? < bobs «++ bzp. 


and letting p — 0 we obtain the desired result. 
LemMa 3. m(ba,-1, @o — 1) = 0. 
Proor: Let s be the smallest integer such that (s, ao) is an accessible point. 
We proceed as in Lemma 2, with (s, a) playing the role of (0, Z + 1), and 
eventually obtain the following inequality: 


| m (Bay—1, 20 — 1) | k(Bay—1, 20 — 1)p%*qh*?-? = | Dio m(a)k(a)p” g* | 


s—1 
<p (= ian! + a bit’) 


(8) 
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where ~» denotes summation over all boundary points with ordinate > aq. 
The desired result follows. 

Lemma 4. Let h(> ao) be the smallest ordinate for which at least one boundary 
point (w*, h) exists such that m(w*, h) ¥ 0 (If no such h exists the theorem is proved), 
Of all such points let w be the one with the smallest abscissa. Then the point (w, h) 
as a member of the sequence 


(0, ao) (1, a1), (2, a2), -*° 


Proor: If the lemma is not true, then for all boundary points a with ordinate 
h, m(a) = 0, except that m(b,, h) + 0. Let W be that accessible point of R 
whose ordinate is h + 1 and whose abscissa v is a minimum. Let k, (a) be the 
number of paths in R from W to the boundary point a. For boundary points 
a accessible from W we have 


(9) bobi +++ bakw(a) > k(a). 
From (1), (2), (3), and (9) we have 
(10) | m(ba, h) | k(n, h)p"g’® = | Za(m(a)k(a)p’g" | < Eak(a)p"g” 


+ bob +++ bap" 'q’ = K*p'™, 


where: 

a) 2, denotes summation over all a e B for which y > h 

b) 2 denotes summation over all boundary points a of ordinate h + 1 and 
abscissa < v. 

c) K* denotes a constant. 

From this it easily follows that m(b, , h) = 0, in contradiction to the definition 
of h. This proves Lemma 4. 

Proor oF THEOREM 3: Let (w, h) be as defined in the statement of Lemma 4. 
From Lemma 4 it follows that, if any other boundary points with abscissa w 
exist, they must be members of the sequence (bo , 0), (b1, 1), (b2, 2), --- and 
hence their ordinates are < h. From the definition of (w, h) and from Lemma 4 
it follows that for any a e B whose abscissa is < w, m(a) = 0. 

Now in the proofs of Lemmas 1-4 the roles of x and y are not symmetrical. 
However, symmetry of course exists, and analogous lemmas follow. In par- 
ticular, the analogue to Lemma 4 has as a consequence that, since w is the 
smallest abscissa such that m(a) = 0 when abscissa of a < w, and m(w, h) ¥ 0, 
there exists a boundary point (w, h’), such that m(w, h’) ¥ 0 and (wu, h’) isa 
member of (bo, 0), (b1, 1), (b2, 2),... Then h’ < hk. But this contradicts 
the definition of h and proves the theorem. 

It is easy to see that, if the boundary points of a closed region constitute 
a single “curve” instead of two ‘‘curves’” as in a doubly simple region, the 
estimate p(a) will be the only proper unbiased estimate of p. 

It is interesting to consider some of the consequences of Theorem 3 for all 
unbiased estimates (not necessarily proper) for doubly simple regions. An 


\ 
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examination of the proof of Theorem 3 shows that it would go through with 
little change if equation (3) were replaced by the requirement that | m(a) | 
be bounded. We therefore obtain the following result: If for a doubly simple 
region there exists an unbiased estimate p(a) of p, not identically equal to f(a), 
then not only is p(a) not proper, but also, no matter how large M, there exists a 
boundary point a such that | p(a) | > M. The uselessness of such an estimate 
is manifest. 

The author is of the opinion that freedom from bias is not necessarily an in- 
dispensable characteristic of an optimum estimate. In general there is no 
reason for requiring the first moment of the estimate rather than any other 
moment to be the unknown parameter. The justification in any particular 
case must be based on special conditions of the problem. 

The author is indebted to Mr. Howard Levene for reading the present paper 
and making valuable suggestions. 
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DIFFERENTIATION UNDER THE EXPECTATION SIGN IN THE 
FUNDAMENTAL IDENTITY OF SEQUENTIAL ANALYSIS 


By ABRAHAM WALD 


Columbia University 


1. Introduction. Let {z.} (a = 1, 2, ---, ad inf.) be a sequence of random 
variables which are independently distributed with identical distributions. 
Let a be a positive, and b a negative constant. For each positive integral value 
m, let Zm denote the sum z; + --- + 2m. Denote by n the smallest integral 
value for which Z,, does not lie in the open interval (b, a). For any random 
variable u, let the symbol E(u) denote the expected value of u. The following 
identity, which plays a fundamental role in sequential analysis, has been proved 
in [1]. 


(1.1) Efe*™g(t)""] = 1, 
where 
(1.2) g(t) = E(e*') 


and the distribution of z is equal to the common distribution of z; , z2, ---, ete. 


Identity (1.1) holds for all points ¢ in the complex plane for which ¢(é) exists 
and |¢(t)| > 1. 
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The purpose of this paper is to formulate conditions under which we may 
differentiate (1.1) with respect to ¢ under the expectation sign. This is of 
interest, since various results in sequential analysis can easily be established 
by differentiating (1.1) under the expectation sign. For example, the formula 
for E(n) can immediately be obtained by differentiating (1.1) att = 0. The 
derivative of e*"y(t)~” at t = Ois given by 


(1.3) Zn — oN n = Z, — E(z)n 


where ¢’(¢) denotes the derivative of g(t). Hence, if we may differentiate (1.1) 
under the expectation sign, we obtain the basic formula 


(1.4) E(Z,) = E(z)E(n). 





If E(z) ¥ 0, the above equation has been used [2] to derive lower and upper 
limits for E(n). If, however, E(z) = 0, formula (1.4) is of little value. It will 
be shown in section 3 that 





(1.5) E(n) = =yn When E(z) = 0. 
This result is obtained, as will be seen in section 3, by differentiating identity 
(1.1) twice at ¢ = 0. 


2. A sufficient condition for the differentiability of (1.1) under the expectation 
sign. In what follows, the parameter ¢ in (1.1) will be restricted to real values, 
even if this is not stated explicitly. For any random variable u and any relation 
R, the symbol E(u | R) will denote the conditional expected value of u under 
the restriction that R holds. In this section we shall establish the following 
theorem. 

THEOREM 2.1. Jf g(t) exists for all real values t, identity (1.1) may be differen- 
tiated under the expectation sign any number of times with respect to t at any value 
t in the domain g(t) > 1. 

Proor: First we shall derive an upper bound for E(e’“"|n = m) for any 
given integral value m. Consider the case when ¢ > 0. Then 


t Zn 


(2.1) E(e’**|n = m) < E(e’**|Z, > a, n = m) (t > 0). 
Clearly, 


1 
(2.2) E(e***|Z, >a,n =m, e7""' = pe") = o'oB ( |e? > >. 
Let l(t) denote the least upper bound of the expression 


(2.3) pb (ett >) 


p 


— oo A 


AD 
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with respect to p over the interval (e°'!*!, 1). The existence of ¢(t) implie® 
that l(t) is finite. It follows from (2.1) and (2.2) that 


(2.4) E(e'"|n = m) < e*'U(2) (t > 0) 

and, therefore, also 

(2.5) E(e'") < Ut) (t > 0). 
If ¢ < 0, one can show in a similar way that 

(2.6) E(e'"|n = m) < e’‘U(t) (t < 0) 

and 

(2.7) E(e'*") < eU(t) (t < 0). 


To prove Theorem 2.1, it is sufficient to show that the following two proposi- 
tions hold.’ 


PROPOSITION 2.1. All derivatives of e”"'g(t)”" with respect to t exist in the 
domain g(t) > 1. 

PROPOSITION 2.2. For any positive integral value r and for any finite interval I 
in which g(t) > 1, it is possible to find a function D(Z,, , n) such that 


(2.8) Dyn) > Fle o"I| 


for all values t in I and 
(2.9) E(D(Z, ,n)| < 2. 


Proposition 2.1 is clearly true, if all derivatives of g(t) exist. The existence 
of these derivatives follows from the existence of y(t) for all values ¢. 


dt 
Since 77 e”"‘o(t)" is equal to the sum of a finite number of terms of the type 


Z'in"*e7"‘o(t)_", Proposition 2.2 is proved if we can show that for any given 
integral values 7; and rz there exists a function D,,,.(Zn , n) such that 





(2.10) Dyir(Zn, 2) > | Ziin"e7"“g(t)* | 

for all ¢ in J and 

(2.11) E[D,,7.(Zn, n)|] < @. 
Clearly, since g(t) > 1 in J, 

(2.12) | 273"e7*“o(t)* | < | Z2 | ne! 7% 


where & is an upper bound of |¢| in J. Let é be a value > &. Then for a 
properly chosen constant C we have 


(2.13) | 27 | el al" < Ce! Znlt, 


1See, for example, E. J. McShane, Integration, Princeton University Press (1944), p. 
216, 217 and 276. 
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Hence, it follows from (2.12) and (2.13) that 


(2.14) |Z n"e="'o(t)-* | < Cn el!" < Cn'(e7 4° nt) 
for all ¢ in 7. 
We put 
(2.15) Dy,r(Za, 2) = Cn"2(e7"t 4. 97 2atty, 
We have 


(2.16) E[D,,+(Zn ,n)] = C > Pmm"*[E(e2" | n = m) + E(e2"** |n = m)| 


where p,, denotes the probability that n = m. 
Hence, because of (2.4) and (2.6), we obtain 
(2.17) El[D,.(Zn, n)] < C(e"Uh) + e°"l(—&)][Zpnm™] = 
= Celt) + e°"l(—t)]E(n"). 


Since all moments of n are finite,” Proposition 2.2 is proved. This completes 
the proof of Theorem 2.1. 


3. The expected value of n when E(z) = 0. It will be shown in this section 
that 
E(Z) 
E(2?) 
if identity (1.1) can be differentiated twice under the expectation sign at ¢ = 0. 
The second derivative of e'”"y(t)~” with respect to ¢t is given by 


‘OP _ oe’ WOe® — We’ OF) zat py—n 
3.2 |Z — a2 0] - nee — let " o(t 
sa a(t wor f° 
where ¢’(t) denotes the first, and ¢’’(t) the second derivative of ¢(é). 
Since ¢(0) = 1, ¢’(0) = E(z) = Oandg’(0) = E(z’), putting t = 0, expression 
(3.2) becomes 
(3.3) Zn — nel'(0) = Z, — nE(z’) 


Hence, if (1.1) may be differentiated twice under the expectation sign at ¢ = 0, 
we obtain 

(3.4) E(Z’, — nE(z’)] = 0 

from which (3.1) follows. 

An approximate value of E(n) can be obtained from (3.1) by neglecting the 
excess of Z, over the boundaries. Then Z, can take only the values a and 
b. Hence 
(3.5) E(Z;,) ~ a@P(Z, > a) + UP(Z_ < b) 


where the sign ~ denotes approximate equality. 


(3.1) E(n) = 





when E(z) = 0, 





2See the paper by C. Stein, ‘‘A note on cumulative suns,”’ in this issue of the Annals 
of Mathematical Statistics. 
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It was shown in [1] (equation 28) that neglecting the excess of Z, over the 
boundaries, the approximation formula 
1 = e* 
(3.6) P(Z, > a) ~ aera 
holds, where h is the non-zero root of the equation g(t) = 1. This formula was 
derived there under the assumption that E(z) ~ 0. If E(z) approaches zero, 





h— 0 and the right hand member of (3.6) converges to 








a—b 
Putti i a eh a et ee ng b 
utting P(Z, > a) = 75 an (Z, < b) = a a 
tain from (3.5) 
—b a 
7 2 2 2 a 
(3.7) E(Z,) ~a Eager ab. 
Hence’ 
. —ab 
(3.8) E(n) E@)’ 


Limits for E(n) can be obtained by deriving limits for E(Z’%,). Let r be a 
non-negative real variable. One can verify that 


(3.9) a < E(Z,|Zn >a) < lub. El(a—rt+ z)’|z> 
0<r<a—b 
and 
(3.10) 06° < E(Z2,|Z, <b) < lub. Elb+r+2?\2+7r< O. 
O0<r<a—b 
We have 


(3.11) E(Z) = P(Z, > a)E(Z,|Z, > a) + P(Zn < b)E(Z2|Z, < b). 


Limits for E(Z*,) can be obtained by replacing the conditional expected 
values in the right hand member of (3.11) by their limits given in (3.9) and 
(3.10). 
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A NOTE ON CUMULATIVE SUMS 
By CHARLES STEIN 
Columbia Unwersity 


Let {Z:} be a denumerable sequence of identical independent real-valued 
random variables. Two constants a > 0 > bare chosen and the random variable 


n defined as the smallest integer for which one of the inequalities }> Z; > a 
1 


>> Z; < b holds. For any events E; and Ep, P{E,} will denote the prob- 
1 
ability of the event EZ, and P{E, | E,} the conditional probability of the event 
E, given that E, has occurred. 
It will be shown that there exists 4 > 0 such that the moment generating 
function, Ee”‘ exists for any complex number ¢ whose real part is less than or 


equal to é , and as an immediate consequence that n has finite moments of all 
orders. 


If d is any constant satisfying b < d < a, then, for fixed m, 


(1) Pie< bm tacabs Pda < e} 


) 


where c = |a| + |b!. We exclude the case P{Z, = 0} = 1. Then there 
exists « > 0 such that either 


i = P{Z;>e} >0O or & = P{Z, < —e} > 0. 
Taking, for example, the former alternative with m H + 1, 
€ 
(2) p{|3 2.) >cl> P{a>. for i= 1+ ym} = at >0 
1 ' 


where [w] denotes the largest integer less than or equal to w. For any poitive 
integer k, 


I 





P{n > km} eZ ; Bian 
ccc i” Pin > km|n > (k — 1)m} 


<P{o< dz <alb< DZ <a for s= 1, +(e = Dm} 


km, 
since n > km, impiies b < dX Zi <a. 
km, (k—1)m, 


But >> Z; » Zi t+ x Z; and the second sum on the right hand 
1 


(kT 41 
side is idenenitent of all terms in the first sum. 
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km, 8 
Thus the distribution of >> Z; given >> Z; for s = 1, ---, (k — 1)m de- 
1 1 
(k—1)m, 


pends only on >) Z, so that 
1 











P{n > km} f km, (k—1)m, (k—1)m, 
Sere <Pib< Z; i i 
P{n> (k — 1)m} — \ ani + x ~ sere X se 

(3) so. 
<P{ zs < eh <1 - oP by (1) and (2). 
(k—1)m,+1 
Consequently, by induction on k, 
(4) P{n>m} < Pin > H m: < (1 — amin) , 
1 
Let f) be any positive number less than — = log (1 — 67"). 
1 
Then 
Ee"*® sa > e"Pin wa m} 
m=1 
< ps e™ Pt (k — 1)m, <n < km} 
k=l 
(5) <Ded™"Pin > (k — 1m} 
k=1 


lA 


oO 
ps ermito(y foo sy 
k=1 


1 é ' 
= de te" — ar) }*. 


1 — 60 





But this is a geometric series with decreasing terms, and is consequently con- 
vergent. Thus for any ¢ whose real part R(t) < &, the moment generating 
function Ee” exists. Since, for all positive 1, m' < e”’* for sufficiently large 
m, n has finite moments of all orders. 
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1. A Test of Randomness in Two Dimensions. Howarp LrEveNgE, Columbia 
University. 


A square of side N is divided into V2 unit cells, and each cell takes on the characteristics 
A or B with probabilities p and gq = 1 — p respectively, independently of the other cells. 
A cell is an ‘‘upper left corner’’ if it is A and the cell above and cell to the left are not A, 
Let V; be the total number of upper left corners and let V2 , V3; , Vs be the number of simi- 
larly defined upper right, lower right, and lower left corners respectively. Let V = (V, + 
Ve+ V3 + V,)/4. It is proved that V is normally distributed in the limit with E(V) = 
pP(Nq + p)? and o?(V) ~ N2pq?(4 — 20p + 45p? — 27p%)/4. The conditional limit distribu- 
tion of V when pis estimated from the data, and the limit distribution of a related quadratic 
form are also obtained. These statistics are in a sense a generalization of the run statistics 
used for testing randomness in one dimension. 


2. Asymptotic Distribution of Moments from a System of Linear Stochastic 
Difference Equations. Herrman Rusin, Cowles Commission for Research 
in Economics. 


Let a B,y), + Tz) = u, , (6 = 1,2, ---), be a complete system of linear stochastic 
difference equations determining y;; (the coordinates of y,), > 0, in terms of yz: , t < 0, 
and z+, (the coordinates of z;), which are assumed to be fixed variates, and the random 
variables u,; (the coordinates of u,).. Such a system is called a stable if for every bounded 
set of fixed variates, and E(u/u,) uniformly bounded, E(y’y,) is uniformly bounded. This 
condition is shown to be equivalent to >> h:;, | finite, where y, = 7 H,(u}_,- ue 
- Oe is Ji,y/, is the solution of the above difference equation. Let Q, be an infinite 
quadratic form in y;_,,; and 2;_,,x (7, » = 0,1, ---) with coefficients depending only on i, k, 
7, and »v. Such a quadratic form is called convergent if the sum of the absolute values of 
the coefficients is finite. It is shown under fairly general conditions that the mean of a 


; 1 
convergent quadratic form is asymptotically normally distributed with variance o( + ; 


/ 


3. Conditional Expectation and Unbiased Sequential Estimation. Davin 
BLACKWELL, Howard University. 


It is shown that E[f(z2)Eay] = E(fy) whenever E(fy) is finite, and that o?(E.y) < o°(y), 
with equality holding only if Fay = y, where Ey denotes the conditional expectation of y 
with respect to the family of chance variables z.. These results imply that whenever 
there is a sufficient statistic wu and an unbiased estimate ¢, not a function of u only, fora 
parameter p, the function E,t, which is a function of u only, is an unbiased estimate for p 
with variance smaller than that of ¢. A sequential unbiased estimate for a parameter is 
obtained, such that when the sequential test terminates after i observations, the estimate 
is a function of a sufficient statistic for the parameter with respect to these observations. 
A special case of this estimate is that obtained by Girshick, Mosteller, and Savage (Annals 
of Math. Stat., Vol. XVII (1946), pp. 13-23) for the parameter of a binomial distribtion. 


4. A Discussion of the Ehrenfest Model. Preliminary report. Marx Kac, 
Cornell University. 


A particle moves along a straight line in steps A, the duration of each step being r. 
The probabilities that the particle at kA will move to the right or left are (1/2)(1 — k/R) 
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and (1/2)(1 + k/R) respectively. FR and k are integers and |k| < R. M.C. Wang and 
G. E. Uhlenbtck in their paper On the theory of Brownian motion II (Rev. Mod. Phys. Vol. 
17 (1945) , pp. 323-342) discuss this random walk problem and state several unsolved problems. 
In answer to some of the questions raised the following results are obtained: Let (1 — z)#-i 
(I+ 2)#ti = zc! v (j an integer) then, the probability P(n, m | s) that a particle starting 
from nA will come to mA after time ¢ = sr is equal to 2 °*(—1)"*"=(j/R)"Cysi Ce, , 
where the summation is extended over all 7 such that |j| < R. Also, if R is even the prob- 
ability P’(n, 0! s) that the paiticle starting from nA will come to 0 at ¢ = sr for the first 
time is calculated. For n = 0 this gives a solution of the so-called recurrence time problem 
first studied on simpler models by Smoluchowski. Through a limiting process in which 
7-0, A— 0, A?/27 — D, 1/Rr > B, nA > 2% , MA — 2, sr = t, one is led to fundamental 
distributions concerning the velocity of a free Brownian particle. In particular, P(n, m | s) 
approaches the well-known Ornstein-Ublenbeck distribution. 


5. Sampling from Contaminated Distributions. Preliminary report. JoHn W. 
Tukey, Princeton University. 


A contaminated distribution is a nearly normal distribution in which extreme observa™ 
tions are more frequent than in a normal distribution. By studying the bias and vari" 
ability of several measures of dispersion when applied to samples from particular one 
parameter families of contaminated distributions it is shown that: (i) for nearly norma 
distributions, the mean deviation is often better than the standard deviation; (ii) smal 
changes in the underlying distribution may increase the sampling variance of the standard 
deviation by a factor of three. This suggests that, in a broad class of cases, the mean devia- 
tion is safer than the standard deviation when a single dispersion is estimated from a set 
of data. This conclusion need not apply in an analysis of variance situation. 


(. On the Class of Functions Defined by the Difference Equation (« + 1)f(« + 1) 
= (a + bx) f(x). Leo Katz, Wayne University. 


The difference equation defines only three discrete functions: the binomial, the Poisson 
and the Pascal functions; the first and third have one parameter (JN) slightly generalized. 
It is shown that the Pascal function with this generalization is identical with the Polya- 
Eggenburgher distribution, which is a very useful form of the Compound Poisson Law and 
has been used to explain probability situations involving contagion. Areas for all func- 
tions in the class are given in terms of existing tables of the incomplete y and 6-functions. 
Observed distributions are fitted by two moments. As Carver (Handbook of Mathematical 
Statistics) pointed out, the advantages of fitting by difference equations are many; not the 
least is the fact that it is unnecessary to discriminate among the various functions in fitting 
an observed distribution. The problem of discrimination, posed by Frisch (Metron, Vol. 
10) and others, may be resolved in terms of the sampling distribution of variances for the 
Poisson function, since the three functions correspond to situations where the variance is 
less than, equal to, or greater than the mean, respectively. 


7. Retention of Decimal Places in Matrix Calculations. FRANKLIN KE. SATTER- 
THWAITE, Aetna Life Insurance Company. (Read by title) 


The accumulation of errors in matrix calculations has been studied by the author and 
others for special types of matrices and for special methods of calculation. In the present 
paper, error formuiae are developed for the standard Doolittle and Waugh-Dwyer Compact 
routines. These formulae do not place any restrictions on the matrices involved and do 
not require any extra calculations or initial approximations. Simple rules are developed 
which give for each step in the calculations the number of decimal places which must 
be retained. These rules are efficient in the sense that the retention of fewer places will, 
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except for good fortune in balancing of errors, lead to results less accurate than those 
specified. The rules also assist in choosing that arrangement of the calculations which 
will lead to the smallest average number of significant figures which must be retained 
for the calculation as a whole. 


8. The Efficiency of the Mean Moving Range. Pav. G. Hoe, University of 
California at Los Angeles. (Read by title) 


The statistic w = > — | a4, — ti | W2/2(n — 1) is studied as an estimate of o for a 
normal variable subject to trend effects. It is shown that the efficiency of w compares 
favorably with that of the mean square successive difference, 5?. The proof that w, and 
also 6?, is asymptotically normally distributed is made to depend upon a general result 
that can be derived from a theorem of S. Bernstein on dependent variables. 


9. Some Basic Theorems for Developing Tests of Fit for the Case of the Non- 
Parametic Probability Distribution Function. Braprorp F. Kurmsatt, 
New York State Department of Public Service. (Read by title) 


Given a universe with C.D.F. P[X < x] = F(x). Consider a random sample of n values 
x; which have been ordered so that xz; < x4: . The successive differences of the true c.d_f. 
values at X = xz; are denoted by u;. Thus 


wu = f(21) 
u; = F(2;) — F(ay-1), 2 <t<an 
Un+1 = ; —_ Pe). 
‘(THEOREM 1. The product power moments 
E(urusu; +++) 
for any or all different indices from 1 to n + 1, where the powers are real numbers greater than 
minus one, are given by 
— Tin+))MptUDrqt+)rw+)-:-: 
El(usu,u, +++) = 
rnt+l+p+qt+wet+-:::) 





Coro.tuary. If a range R(k, m) is defined by 
RO, m) = F(2m), R(n +1, m) = 1 — F(2n41-m) 
R(k, m) = F(Xi+m) — F (ze) 


where k and m are positive integers such that m < nand k + m < n, its probability distribu- 
tion is independent of k, and hence equal to that of F (rm). 
THEOREM 2. Given a test funtion of u; 


Y=) uj 
m 


where p is a real positive number, and the sum is for m indices chosen at random on the range 
lton+41. Let ¥ and o? denote the mean and variance of this test function. Establisha 
convention for increasing the indices included in the above sum for increasing m as n increases, 
such that [m/(n + 1)] = constant, to nearest multiple of 1/(n + 1). Then the asymptotic 
distribution of (Y — Y)/o for inreasing n, subject to the above condition, is the normal dis- 
tribution with zero mean and unit variance, except in the trivial casem =n+1,p = 1. 
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10. Confidence Limits for the Fraction of a Normal Population which Lies 


between Two Given Limits. Jacos Wo.rowirz, Columbia University. 
(Read by title) 


Let 2 , -++, ty be N independent observations from a normal population with mean yu 
and variance o”, both unknown. Let NZ = 22; and (N — 1)s? = (x; — #)? define Z and s?. 
Let L, and Lz be given constants with L; < Lz , and let 


= a y= pi 
y = (W2n0) exp -14} dy 
Li o 
By a lower confidence limit on y with confidence coefficient a is meant a function D(z, , 
-++,w) such that the probability isa that D<+y. Since and sare sufficient estimates of 
p and o? the restriction that D be a function of and s only is imposed. It is assumed that 
there exist a) a positive d such that L,; + d< yu < L2—d; b) a positiveC such that o < C. 
From these it follows that there exists a lower bound G = G(d, C) ony. Let xj~a be that. 
number for which P{x? < x{_2} = 1 — a, where x? has N — 1 degrees of freedom, and let 
V/N — 1s 
l-a , 


owt. <6 < hi, 











w= It is shown that if D be defined as follows: 


Le—z/w 
D = (2x) / exp {—}y*} dy 
L\—i/w 
2) D = G otherwise, then | P{D < y} — a| approaches zeroas N— ~. Thus Disa large 
sample lower confidence limit. The extension to upper and two-sided limits presents 
no difficulty. 


11. The Consolidated Doolittle Technique. Paun BoscHan, Econometric 
Institute. (Read by title) 


The quadratic matrix notation is interpreted as a segment in a sequence of matrices 
wherein each successor matrix is augmented by a bordering row and column. Extension 
theorems based on this idea date back into the last century. The step from the original 
concept to one of higher order is also fruitful in discussing inverse matrices, specifically 
the inverse of a symmetric matrix. The symmetry of the matrix of normal equations for 
a set of multiple regression coefficients is restored by adding the transpose of the column 
on the right side of the equations, i.e. the co-variances with the dependent variable and 
the variance of the dependent variable itself. The inverse of this matrix can be con- 
structed as partial sum over a series of matrices. Each individual element of this series 
is in itself meaningful. The solution for the set of multiple regression coefficients relating 
the k-th variable to the preceding (k — 1) variables is a column matrix. The product of 
this matrix with its transpose expressed in terms of the residual variance forms the k-th 
term in the matrix series. The summation of the first n products yields the inverse matrix. 
This characteristic of the inverse can be used to great advantage in the standardization 
of elementary computational steps. 


12. Estimation of Structural Equations through Linear Transformation of 
Regression Coefficients. THroporE W. ANDERSON and HERMAN RvBIN, 
Cowles Commission for Research in Economics. 

A method is presented for estimating the coefficients of a single structural equation in 


a system By) + Tz) = u) (t = 1,2, ---, 7), where B and F are matrices of coefficients, y; 
is a row vector of G observed jointly dependent variables, z; of K observed predetermined 
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variables and u; of G random elements. Given the distribution of the random elements, 
the equations define the distribution of the y;. Some coordinates of z, may be coordinates 
of y:-1 , etc. It is assumed that the structural equation to be estimated has at least G — ] 
coefficients prescribed zero. The part of the population regression matrix corresponding to 
the predetermined variables with zero coefficients has rank one less than the number of 
jointly dependent variables with non-zero coefficients. The maximum likelihood estimate 
of this matrix is a linear transformation of the unrestricted sample regression matrix. The 
estimated vector of coefficients of y; is the vector annihilated by this matrix. The vector of 
coefficients of z; is estimated by means of this vector and the regression matrix. These 
estimates are consistent and asymptotically normally distributed. For z, fixed, small 
sample confidence regions are given for the coefficients. 


tl a OO 


NEWS AND NOTICES 


Readers are invited to submit to the Secretary of the Institute news items of interest 


Personal Items 


Dr. Armen A. Alchian, who has been discharged from the Army with the rank 
of Captain, is now an Assistant Professor in the Economics Dept. at the Uni- 
versity of California at Los Angeles. 

Dr. Franz L. Alt is now Assistant Director of Research at the Econometric 
Institute. 

Colonel Dinsmore Alter is on terminal leave after more than four years’ 
service in the Transportation Corps of the Army, and has returned to his duties 
as Director of the Griffith Observatory in Los Angeles. During these years 
Colonel Alter traveled approximately 250,000 miles on the ocean as a Trans- 
port Commander, visiting each continent except the Antarctic. 

Dr. Theodore W. Anderson, formerly with the Cowles Commission, is now an 
Instructor in the Dept. of Math. Statistics at Columbia University, and plans 
to be on a Guggenheim Fellowship beginning in June 1947. 

Mr. Herbert Barkan has been appointed to an Instructorship in the Newark 
College of Engineering. 

Mr. Robert E. Bechhofer, formerly a statistician with The Kellex Corpora- 
tion, is a graduate student at Columbia University this year. 

Mr. Stanley G. Behrends is now Cost Accountant with the California Wire 
Cloth Corporation, in Oakland. 

Messrs. Carl A. Bennett, Jack I. Northam, and Max A. Woodbury have 
all returned from various types of war service to the University of Michigan 
as graduate students in statistics. Mr. Bennett was with the Manhattan En- 
gineering District for over two years, first at the Metallurgical Lab., University 
of Chicago, and then at Oak Ridge, Tenn. Mr. Northam was recently dis- 
charged from the Army with the rank of Lieutenant, having served with the 
Signal Corps for four years in the Pacific area. Mr. Woodbury was discharged 
from the Army with the rank of Captain, having been in the Meteorology serv- 
ice for five years, most of which time was spent in the European theater. 

Mr. Richard Berger has received his discharge from the Navy, and is employed 
as a Research Analyst with Dun and Bradstreet. 

Dr. Archie Blake, formerly at Aberdeen Proving Ground, is now Senior 
Statistician in the Office of the Army Surgeon General. 

Dr. Ernest E. Blanche, who had been teaching in one of the European Army 
University Centers, is now Principal Administrative Analyst in the Plans and 
Policy Office of the War Department General Staff, and is also Lecturer at 
American University. 

Mr. Royal F. Bloom has resigned the position which he held for a short time 
with the Psychology Dept. of Iowa State College after his release from the 
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Navy last March, and has returned te the Navy Department as Assistant Head 
of the Classification Research Division, Bureau of Naval Personnel. 

Mr. Earl K. Bowen has been appointed to an Instructorship in statistics at 
Babson Institute of Business Administration. 

Mr. Albert H. Bowker is enrolled this year as a graduate student at the Uni- 
versity of North Carolina. 

Mr. Charles R. Brearty has joined the Technical Staff of Bell Telephone 
Laboratories, Inc. 

Mr. Clyde A. Bridger is on leave from his position at the University of Utah, 
and is spending the year at the Institute of Statistics in Raleigh, North Carolina. 

Mr. Arthur W. Brown, formerly with the Columbia University Division of 
War Research, is now with the Standard Oil Company of New Jersey. 

Dr. George W. Brown, formerly connected with the RCA Laboratories at 
Princeton as Research Engineer, has accepted a position as Research Associate 
Professor in the Statistical Laboratory at Iowa State College. 

Mr. Richard H. Brown has been appointed to a Lectureship in Mathematics 
at Columbia University. 

Dr. Richard 8S. Burington, Director of the Evaluation and Analysis Groups of 
the Research and Development Division of the Bureau of Ordnance, Navy 
Department, has been named Chief Mathematician, Bureau of Ordnance. 

Mr. Roy A. Chapman, who has been Silviculturist at the Hitchiti Exper- 
imental Forest, Round Oak, Georgia, is now with the U.S. Forest Service in 
Washington, D. C. 

Dr. Way Ming Chen has been appointed to an Instructorship in mathematics 
at Brown University. 

Dr. John M. Clarkson has been promoted to a professorship at North Carolina 
State College. 

Mr. S. Lee Crump has been promoted to an Assistant Professorship at Iowa 
State College. 

Dr. Joseph F. Daly, formerly an Instructor at Catholic University, and more 
recently a Lieutenant in the Navy Department, is now Statistician with the 
Bureau of the Census. 

Dr. Daniel B. DeLury has been promoted to a professorship in statistics at 
Virginia Polytechnic Institute. 

Dr. Acheson J. Duncan has been appointed to an associate professorship of 
political economy at The Johns Hopkins University. 

Dr. Jack W. Dunlap, formerly at Rochester University and more recently a 
Lieutenant Commander in the U. 8. Navy, is now Director of the Division of 
Biomechanics of the Psychological Corporation. 

Mr. Francis B. Elmore has been discharged from the Army and is Quality 
Control Engineer at the Union Bag and Paper Company, in Savannah, Ga. 

Mr. Mark W. Eudey has returned from service to his former position with the 
Statistical Laboratcry at the University of California. 
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Mr. Charles D. Ferris, formerly at Aberdeen Proving Ground, is now Quality 
Control Engineer with the General Electric Company, in Bridgeport, Conn. 

Mr. Lester R. Frankel is now a statistician with Dun and Bradstreet. 

Mr. John E. Freund has accepted a position as assistant professor of mathe- 
matics at Alfred University. 

Dr. Bernard Friedman has been promoted to an assistant professorship at 
New York University. 

Dr. Milton Friedman has been appointed to an associate professorship in the 
Department of Economics at the University of Chicago. 

Mr. G. Rupert Gause is now with the Technical Staff of the Bell Telephone 
Laboratories. 

Professor Edwin L. Godfrey has been appointed Head of the Department of 
Mathematics and Astronomy at Defiance College. 

Dr. Casper Goffman has been appointed to an assistant professorship in the 
Department of Mathematics at the University of Kentucky. 

Mr. Harry H. Goode is now a Mathematician in the Office of Research and 
Inventions, U. 8. Navy. 

Mr. Robert D. Gordon is a Teaching Assistant in Mathematics at Indiana 
University. 

Mr. Pert A. Gottfried has returned from the service and is Research Analyst 
with Dun and Bradstreet. 

Dr. Clyde H. Graves, formerly at Pennsylvania State College, is now Opera- 
tions Branch Chief of the Office of Price Board Management, OPA. 

Dr. Joseph A. Greenwood has recently been separated from active duty with 
the Navy and is now a statistician in the Bureau of Aeronautics. 

Mr. Harris T. Guard has returned to Colorado A. and M. as an Instructor in 
the Department of Mathematics. 

Dr. Joy P. Guilford has returned to his former position as Professor of Psychol- 
ogy at the University of Southern California. 

Prof. Emil J. Gumbel, formerly with the New School of Social Research, 
has been appointed to a Special Lectureship in Statistics at Newark College of 
Engineering. 

Mr. Lee S. Gunlogson has been discharged from the Navy and is now in the 
statistical department of the Lumbermens Mutual Casualty Company, Chicago. 

Dr. Paul R. Halmos has been appointed to an assistant professorship in mathe- 
matics at the University of Chicago. 

Professor Preston C. Hammer has returned to his former position at Oregon 
State College. 

Mr. Joseph O. Harrison, Jr. is now employed as a mathematician for the 
Harvard University Automatic Sequence Controlled Calculator Project in 
Cruft Laboratory. 

Mr. Millard Hastay, formerly with the Statistical Reseangh Group at Co- 
lumbia University, is now Research Associate at the National Bureau of Eco- 
nomic Research. 
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Mr. Bernard Hecht has been promoted from Chief Quality Control Engineer 
to Manager of the Quality Control Department of the International Resistance 
Company, Philadelphia. 

Mr. Joseph L. Hodges, Jr. has been appointed to a teaching assistantship in 
mathematics at the University of California. 

Dr. Paul G. Hoel has been promoted to an associate professorship in mathe- 
matics at the University of California at Los Angeles. 

Mr. Richard A. Hornseth has been appointed to an instructorship in the De- 
partment of Sociology and Anthropology at the University of Wisconsin. 

Mr. Harry M. Hughes has been appointed to a teaching assistantship at the 
University of California. 

Mr. Leonid Hurwicz, formerly with the Cowles Commission, has been ap- 
pointed to an associate professorship at Iowa State College. 

Mr. Joseph B. Jeming has been separated from service with the Air Forces, 
and is now a Financial and Economic Consultant in New York City. 

Mr. Paul Johner has been discharged from the Army and is now in the Indus- 
trial Engineering Division of the Aluminum Company of America, New Kens- 
ington, Pa. 

Miss Margaret Kampschaefer, who is a statistician in the War Department, is 
now serving in the Supply Division of the Air Force Service Command in Erlan- 
gen, Germany. 

Dr. Leo Katz has been appointed to an assistant professorship at Michigan 
State College. 

Mr. Frederick G. King has been discharged from the Army and is now a 
civilian instructor in the Anti-Aireraft Artillerv School at Fort Bliss. 

Dr. Tjalling Koopmans has been appointed Associate Professor of Economies 
at the University of Chicago. 

Mr. Paul J. Kopp has been discharged from the Army and is now with the 
Patent Department of the Gulf Oil Corporation, Washington, D. C. 

Dr. Carl F. Kossack has accepted a position as mathematician with the 
Joint Army-Navy Air Intelligence in the Strategic Vulnerability Branch. 

Dr. Waclaw Kozakiewicz has been promoted to an assistant professorship 
in mathematics at the University of Saskatchewan. 

Professor Rafael Laguardia has returned to Uruguay as Director of the In- 
stitute de Matematica y Estadistica, Facultad de Ingenieria. 

Dr. Charles R. Langmuir, formerly with the Psychological Corporation, is 
now Secretary-Treasurer and Lab. Director of the Bennett and Langmuir 
Development Corporation, Mamaroneck, N. Y. 

Mr. Charles M. Larson has accepted a position as mathematician with the 
Pacific Mutual Life Insurance Company, Los Angeles. 

Miss Lucy A. LaSala, formerly with the research group at Columbia Uni- 
versity, is now @eacher of mathematics at East New York Vocational High 
School. 

Dr. Richard A. Leibler is now a Member of the Institute for Advanced Study, 
Princeton. 
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Miss Grace L. Lesser, formerly with the research group at Columbia Uni- 
versity, is now employed as a statistician with the Econometric Institute. 

Miss Myra Levine has accepted a position as statistician with the Socony- 
Vacuum Oil Company, in New York City. 

Dr. Jerome C. R. Li has been appointed to an instructorship at Oregon State 
College. 

Professor William T. Martin has accepted a professorship in the Department 
of Mathematics at Massachusetts Institute of Technology. 

Miss Ethelyne L. McBee, formerly with the U. 8. Department of Agriculture, 
is now teaching science and mathematics at the Falls Church High School, 
falls Church, Virginia. 

Dr. Paul W. McGann has been appointed to an assistant professorship in 
economics at American University. 

Dr. Max F. Millikan has been appointed to a research associateship at Yale 
University. 

Mr. Probodh C. Mittra has accepted a position as consulting statistician with 
the United Nations Economic and Social Council. 

Dr. Marjorie FE. Moore has transferred from her position as statistician with 
the Social Security Administration, to one as Program Analyst in the Office of 
Vocational Rehabilitation, Federal Security Agency. 

Miss Judith Moss, who was with the research group at Columbia University, 
is now research assistant with the National Bureau of Economic Research. 

Dr. Frederick Mosteller has been appointed to a lectureship and research 
associateship in the Department of Social Relations at Harvard University. 

Mr. James E. Myers, formerly with the Naval Research Laboratory at Ana- 
costia Station, is now with the research group of the Moore School of Electrical 
Engineering, University of Pennsylvania. 

Mr. Stanley W. Nash is a graduate student this year at the University of 
California. 

Professor J. Neyman is on leave from his position at the University of Cali- 
fornia for the fall semester, and is Visiting Professor of Mathematical Statistics 
at Columbia University. 

Mr. Russell T. Nichols has been discharged from the Army, and is a graduate 
student at the University of Chicago. 

Mr. Harold Nisselson has been discharged from the Navy, and is now a statis- 
tician in the Bureau of the Census, where he was formerly employed. 

Professor Nilan Norris has been separated from his service with the Army, 
with the rank of Major, and has returned to his position in the Department of 
Economics at Hunter College. 

Dr. Guy H. Oreutt, formerly at Massachusetts Institute of Technology, has 
accepted a research position in the Department of Applied Economics, Cam- 
bridge University. This new department is to be modelled somewhat along the 
lines of the Cowles Commission at the University of Chicago, and is to be under 
the direction of Dr. J. R. N. Stone. 








510 ' NEWS AND NOTICES 


Mr. Warren H. Page has been separated from service with the Army, and is 
now a graduate student at Columbia University. 

Mr. Nicholas Pastore has been appointed to an instructorship at Union Junior 
College, Cranford, New Jersey. 

Mr. I. B. Perrott has been demobilized from the British Army with the rank 
of Major. 

Mr. George W. Petrie, II] has accepted a position as Special Engineer with 
the Bethelehem Steel Company. 

Dr. Harry 8S. Pollard has been promoted to a professorship at Miami Uni- 
versity. 

Dr. G. Baley Price, Professor of Mathematics at the University of Kansas, 
has been awarded a Post-Service Guggenheim Fellowship, beginning September 
1, 1946. 

Mr. Robert J. Randall has been discharged from the Army and is now a 
graduate student at Columbia University. 

Professor Lowell J. Reed, of the School of Hygiene and Public Health, The 
Johns Hopkins University, has been appointed Vice-President of the University. 

Dr. Francis Regan has been promoted to a professorship at St. Louis Uni- 
versity. 

Mrs. Kathryn B. Rolfe, formerly at the University of California at Berkeley, 
has accepted a position as associate in mathematics at the University of Cali- 
fornia College of Agriculture, at Davis. 

Mr. Frank Saidel is a graduate student in mathematical statistics this vear at 
Columbia University. 

Dr. Leonard J. Savage has been awarded a Special Rockefeller Fellowship, 
beginning September 1946. 

Professor Henry Scheffé of the University of California at Los Angeles has 
been awarded a Guggenheim Fellowship, and is spending the vear at the Uni- 
versity of California at Berkeley. 

Professor Andrew 8S. Schultz, Jr. has been separated from service with the 
Army and has returned to Cornell University with the rank of associate pro- 
fessor. 

Dr. Saul B. Sells, formerly with the OPA, has accepted a position as Assistant 
to the President of the A. B. Frank Company, San Antonio. 

Mr. Lawrence W. Shaw is now a statistician with the U. 8. Public Health 
Service in Bethesda. 

Dr. Ronald W. Shephard has been appointed to a lectureship at the Uni- 
versity of California, Berkeley. 

Mr. Clifford R. Simms has accepted a position as manager of the Cleveland 
office of the B. E. Wyatt Company. 

Mr. George B. Simon has been separated from Army service with the rank of 
major, and has accepted a civilian position as chief of the Analysis and Research 
Unit, Psychological Section, Office of Surgeon, Barksdale Field. 

Mr. Herbert Solomon has been appointed to an instructorship at the College 
of the City of New York. 

Mr. Melvin D. Springer has returned to the University of Illinois and has been 
appointed to an assistantship. 
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Mr. Andrew P. Stergion has been discharged from the Army, and is now Sta- 
tistical and Quality Control Engineer with the Corning Glass Works. 

Mr. Milton 8. Stevens has been discharged from the Navy, and has accepted 
a position as Director of Special Projects with Time, Ine. 

Dr. George J. Stigler has been appointed to a professorship in economics at 
Brown University. 

Mr. Alexander L. Stott has been discharged from the Navy, and is now a staff 
assistant in the Treasury Department of the American Telephone and Tele- 
graph Company. 

Dr. L. V. Toralballa has accepted a teaching position at Fordham University. 

Dr. Walter R. Van Voorhis has returned to Fenn College, with the rank of 
associate professor. 

Mr. Edward H. Van Winkle has been appointed to a professorship of business 
statistics at Rensselaer Polvtechnic Institute. 

Dr. Charles W. Vickery has been appointed to an associate professorship at 
Ohio State University. 

Mr. David F. Votaw, Jr. has been separated from service with the Navy and 
has returned to Princeton University as Research Associate. 

Mr. W. Allen Wallis has been appointed to a professorship at the University 
of Chicago. 

Mr. Ralph EF. Wareham is now managing director of the National Photocolor 
Corporation. 

Dr. Jacob Wolfowitz has been appointed to an associate professorship in 
mathematical statistics at Columbia University. 

Mr. John F. Wyckoff, formerly at Trinity College, has accepted a position in 
the Research Division of the Actuarial Department, Connecticut General Life 
Insurance Company, Hartford. 

Mr. Earl K. Yost, Jr. has been appointed to a graduate assistantship in 
mathematics at the University of Oregon. 


Ce 


A conference on applied mathematical statistics was held at Lake Junaluska, 
North Carolina, August 4-9, 1946 under the sponsorship of the Institute of 
Statistics of the University of North Carolina. The following individuals at- 
tended the conference: C. I. Bliss, W. G. Cochran, Gertrude M. Cox, D. B. 
Dunean, C. Eisenhart, R. A. Fisher, Carl F. Kossack, Frederick Mosteller, 
H. W. Norton, Paul Peach, Charles F. Roos, Walter A. Shewhart, Frederick 
Stephan, Gerhard Tintner, John W. Tukey, 8. 8. Wilks, C. P. Winsor, and J. 
Wolfowitz. 


(RI em 


Newark College of Engineering is sponsoring a series of conferences on In- 
dustrial Statistics. The first of these, on Acceptance Sampling, began on Sep- 
tember 27 and ran for eleven four-hour Friday sessions. Among the members 
of the Advisory Panel on Industrial Statistics are Institute members 8. B. 
Littauer, A. I. Peterson, W. A. Shewhart, and S. 8S. Wilks. 
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New Members 


The following persons have been elected to membership in the Institute: 

Anscombe, F. J. Rothamsted Experimental Station, Harpenden, Herts, Eng. 

Back, Kurt W., M.A. (California at L.A.) Stat., Surveillance Branch, Ballistic Res. Lab., 
Aberdeen Proving Gd., Md. 

Bresnahan, Maurice F. Stat., U.S. Bur. of Labor Statistics, Wash., D. C., Apt. 305, 1015 
N St., N.W., Wash. 1 

Chung, Kai-Lai, M.A. (Princeton) Graduate Coll., Princeton Univ., Princeton, N. J. 

Clarke, P.C. Asst. Gen. Mgr., Hunter Pressed Steel Co., Lansdale, Pa., Line Lexington, 
Pa. 

Coon, Helen J., M.A. (Southern Methodist) Ballistic Res. Lab., Aberdeen Proving Gd., Md. 

Copp, Warren F., B.S. (Ohio State) Supv., Quality Control Dept., Wheeling Steel Corp., 
Yorkville Works, Yorkville, Ohio 

Divatia, Vasishtha V., B.Sc. (Bombay) Student in Math. Stat., Columbia Univ. #724 
John Jay Hall, Columbia Univ., N. Y. City. 

Fanshaw, Hugh L., M.S. (Manitoba) Standards Supv., Canadian Indus. Ltd., General 
Chemicals Div., Hamilton, Ont., Can., 120 St. Clair Ave. 

Feriet, Kampe de, Dr. Sci. (Paris) Professeur a la Faculte des Sci. de l’Universite de Lille, 
16 rue des Jardins, Lille, France 

Fine, Clarence B., B.S.S. (C.C.N.Y.) Economist, OPA, Wash., D. C., 1388 Tuckerman St., 
N.W., Wash. 11 

Golub, Abraham, B.A. (Brooklyn) Math., Ballistic Res. Lab., Aberdeen Proving Gd., 
Md., Men’s Dorm 

Gomberg, William, Ph.D. (Columbia) Dir. of Mgt., Engr. Dept., International Ladies 
Garment Workers Union, 1710 Broadway, N. Y., N. Y., 444 Beach 142nd St., Neponsit, 
bt. 

Halton, Frederick J., Jr. Asst. to Pres., John Deere & Co., 230 S. Clark St., Chicago, IIl., 
1314 Wesiview Rd., Highland Park 

Hanson, Robert H., M.S. (Iowa) Stat., Bur. of the Census, Wash., D. C., 3/48 Westover 
Dr., Wash. 20 

Hardy, Philip H., B.S. (Rice) Quality Engr., General Elec. Co.; on leave, Cpl. US Army, 
4000 BU §8q. S8., Wright Field, Dayton, Ohio 

Hasty, Willis L., Jr., B.C.S. (Benjamin Franklin) Capt., Signal Corps, 3475 South Wake- 
field St., Arlington, Va. 

Hess, Ida I., A.B. (Indiana) Stat., Population Div., Bur. of the Census, Wash., D. C., 
1425 Rhode Island Ave., N.W. 

Jacobson, Jack J., M.B.A. (Chicago) Stat., Spiegel, Inc., Chicago, Ill., 3562 W. Palmer St. 

Janko, Prof. Jaroslav, Technical Univ., Prague, Czechoslovakia, Na bojisti 3, Praha II 

Karp, Abraham E., M.S. (C.C.N.Y.) Stat., Aberdeen Proving Gd., Md., 25 Aberdeen Ave. 

Keefe, David P., B.S. (St. Thomas) Supv., Raw Material Testing, Minn. Mining and Mfg. 
Co., St. Paul 6, Minn., 590 Holly Ave., St. Paul 

Kellogg, Lester S., M.A. (Northwestern) Chief, Prices and Cost of Living Branch, Bur. 
of Labor Statistics, U.S. Dept. of Labor, Wash., D. C., 404 Shady Lane, Falls Church, 
Va. 

Kindig, Fred E., B.S. (Pennsylvania State) Industrial Math., Westinghouse Elec. Co., 
Braddock Ave., E. Pittsburgh, Pa., 53 Nantucket Dr., R.D.6, Pleasant Hills, Pitts- 
burgh 10 

Kosambi, D. D., ScB. (Harvard) Dept. of Math., Tata Inst. of Fundamental Res., 53 
Pedder Rd., Bombay, India 

Kruskal, William, M.S. (Harvard) 150 W. 30th St., N. Y., N.Y. 

Lacey, Prof. Oliver L., Ph.D. (Cornell) Head, Dept. of Psychology, Univ. of Ala., Uni- 
versity, Ala. 
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Martin, Cyrus A., B.S. (Iowa State) Stat., Headquarters Fifth Army, 7817 South Shore 
Dr., Chicago 49, Ill. 

Maxwell, Pat, Jr., B.S. (Fordham) Stat., Surveillance Br., Ballistic Res. Lab., Aberdeen 
Proving Gd., Md., Apt. A6-3, Baldwin Manor 

Mills, Vicente, P.O. Box 2090, Manila, Philippine Islands 

Newman, Doris, M.A. (Michigan) Instr. in Stat., School of Bus. Adm., Crosby Hall, 
Univ. of Buffalo, Buffalo 14, N. Y. 

Ott, Asso. Prof. Ellis R., Ph.D. (Illinois) Dept. of Math., Rutgers Univ., New Brunswick, 
N.J., 199 Sterling Dr., Orange 

Paull, Allan E., B.A. (Manitoba) 114 Mangum, U. of N. C., Chapel Hill, N. C. 

Pollard, A. H. The Mutual Life & Citizens’ Ass. Co. Ltd., Martin Place and Castlereath 
St., Sydney, N.S.W., Australia 

Quenouille, Maurice H., B.A. (Cambridge) Stat., Rothamsted Experimental Station, 
Harpenden, Herts, Eng., 8 Chipstead St., Fulham, London, S.W. 6 

Ryan, Asso. Prof. Thomas A., Ph.D. (Cornell) Dept. of Psychology, Morrill Hall, Cornell 
Univ., Ithaca, N. Y 

Sachs, Rose, B.A. (Goucher) Stat., 1428 R St., N.W., Wash. 9, D.C. 

Sandelius, D. Martin, Fil.kand. (Stockholm) Baltzar von Platensg. 5, VI, Stockholm, 
Sweden 

Schoenbaum, Prof. Emil, Ph.D. (Prague) Dept. of Applied Math., School of Natural 
Science, Univ. of Charles IV, Prague, Czechoslovakia, Praha XIX, Terronska 26 

Schweitzer, Morton D., Ph.D. (Columbia) Stat. and econ. consultant, 60 West 10th St., 
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Shybekay, Derso S., Ph.D. (Budapest) Pres., Indus. Res. Council, P. O. Box 324, Minne- 
apolis 1, Minn. ° 

Stein, Irving, B.S. (Mass. Inst. Tech.) Qual. Control Engr., Polaroid, Inc., 10 Forest Park 
Ave., Adams, Mass. 

Stephens, William H., M.Sc. (Queen’s) British Commonwealth Scientific Office, 1785 
Massachusetts Ave., Wash., D. C., 3807 Military Rd., N.W. 

Strieby, J. Glenn, B.S. (Iowa Wesleyan) Tech. Dept., Kimberly-Clark Corp., Neenah, 
Wis., 616 E. Circle St., Appleton 

Thompson, Juanita, Qual. Control Inspector, 4117 West 25th St., Chicago 23, Ill. 

Thompson, Sidney L., M.S. (Tulane) Ala. Polytechnic Inst., Auburn, Ala. 

Tompkins, Haldan M. Head, Works Control Lab., National Carbon Co., Fostoria, Ohio, 
P.O. Drawer 191 

Tosti, Carlo R., M.A. (Stanford) Capt., AC, Adm. Engr., Tech. Control Officer, Wright 
Field, Dayton, Ohio, 501 Forest Ave., Dayton 5 

Toulouse, Julian H., Ph.D. (Iowa State) Chief Engr., Quality and Specifications Sec., 
Owens-Ill. Glass Co., Toledo, Ohio, P. O. Box 1035-1036, Toledo 1 

Truksa, Ladislav, Ph.D. (Charles Univ.) Dir., Lecturer in Math. Stat., Charles Univ., 
Praha XI-1800, Czechoslovakia 

Urie, Frank -D., A.B. (Michigan) Supt. of Inspection, Elgin National Watch Co., Elgin, 
Ill. 

Vajda, Stefan, Ph.D. (Vienna) Stat., Admiralty, Whitehall, London, Eng., 54 Chapel 
Way, Epsom, Surrey 

Weiss, Lionel, M.A. (Columbia) Gottsberger Fellow, Columbia Univ., N. Y., N. Y., 
2205 Davidson Ave., N. Y. 53 

Weyl, Eric, (Cologne) Textile Consultant, Chicopee Mfg. Corp., Manchester, N. H., 279 
Orange St. 











REPORT ON THE ITHACA MEETING OF THE INSTITUTE 


The Ninth Summer Meeting of the Institute of Mathematical Statistics was 
held at Cornell University, Ithaca, New York, on Thursday, August 21, and 
Saturday, August 23, 1946. The meeting was held in conjunction with the 
summer meetings of the American Mathematical Society and the Mathematical 
Association of America. The following 71 members of the Institute attended 
the meeting: 


P. L. Alger, C. B. Allendoerfer, T. W. Anderson, Jr., J. L. Barnes, E. E. Blanche, Paul 
Boschan, A. H. Bowker, A. E. Brandt, R. 8. Burington, W. G. Cochran, E. P. Coleman: 
H. B. Curry, J. H. Curtiss, J. L. Doob, J. Dutka, P. S. Dwyer, B. Epstein, Will Feller, C- 
D. Ferris, R. M. Foster, J. E. Freund, M. A. Girschick, A. A. Goodman, Louis Guttman, 
W. W. Gutzman, P. R. Halmos, T. E. Harris, Bertha I. Hart, E. H. C. Hildebrandt, P. G. 
Hoel, R. H. Hoskins, Harold Hotelling, W. W. Jacobs, T. J. Jaramillo, Evan Johnson, Jr., 
H. L. Jones, Mark Kac, Irving Kaplansky, Leo Kart, Tjalling Koopmans, C. F. Kossack, 
M. M. Lavin, Walter Leighton, Jr., Howard Levene, M.S. Macphail, J. W. Mauchly, P. J. 
McCarthy, E. C. Molina, Margaret E. Moore, J. E. Morton, L. F. Nanni, P. M. Neurath, 
E. G. Olds, G. B. Price, C. J. Rees, Selby Robinson, Herman Rubin, P. J. Rulon, Arthur 
Sard, F. E. Satterthwaite, I. E. Segal, G. R. Seth, Andrew Sobezyk, Herbert Solomon, C.M. 
Stein, F. F. Stephan, A. P. Stergion, A. W. Tucker, J. W. Tukey, J. L. Ullman, Abraham 
Wald, 8. 8. Wilks. 


The first session, a jeint session with the American Mathematical Society, 
was held on Thursday morning, and was devoted to contributed papers. Pro- 
fessor W. G. Cochran, President of the Institute, presided. The following 
seven papers were presented: 


1. A Test of Randomness in Two Dimensions. 
Mr. Howard Levene, Columbia University. 

. Asymptotic Distribution of Moments from a System of Linear Stochastic Difference 
Equations. 


i] 


Mr. Herman Rubin, Cowles Commission for Research in Economics. 
3. Conditional Expectation and Unbiased Sequential Estimation. 
Professor David Blackwell, Howard University. 
4. A Discussion of the Ehrenfest Model. Preliminary report. 
Professor Mark Kac, Cornell University. 
5. Sampling from Contaminated Distributions. Preliminary report. 
Professor John W. Tukey, Princeton University. 
6. On the Class of Functions Defined by the Difference Equation (x + 1) f(x + 1) = 
(a + bz) f(z). . 
Dr. Leo Katz, Wayne University. 
7. Retention of Decimal Places in Matrix Calculations. 
Dr. Franklin E. Satterthwaite, Aetna Life Insurance Company. 
The following four papers were presented by title: 
8. The Efficiency of the Mean Moving Range. 
Professor Paul G. Hoel, University of California at Los Angeles. 
9. Some Basic Theorems for Developing Tests of Fit for the Case of the Non-Parametric 
Probability Distribution Function. 
Mr. Bradford F. Kimball, N. ¥. State Department of Public Service, New York 
City. 
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10. Confidence Limits for the Fraction of a Normal Population Which Lies Between Two 
Given Limits. 


Professor Jacob Wolfowitz, Columbia University. 
11. The Consolidated Doolittle Technique. 
Dr. Paul Boschan, The Econometric Institute, Inc. 


Abstracts of all these papers appear elsewhere in this issue of the Annals. 

At two o’clock on Thursday afternoon there was a joint session with the Ameri- 
can Mathematical Society which featured the invited address of Professor J. 
L. Doob of the University of Illinois on Probability in Function Space. This 
address was followed by a business meeting of the Institute which featured 
reports by the President, the Secretary-Treasurer, the Editor, and Professor 
Feller, who spoke for the recently created committee on the distribution of the 
Annals in the war areas. 

On Thursday evening there was a joint dinner with the American Mathemati- 
cal Society and the Mathematical Association of America. 

The meeting closed with a session on Friday morning devoted to the topic, 
Multivariate Analysis for Non-Experimental Data. Professor Will Feller, of 
Cornell University, presided. Professor T. Koopmans, of the Cowles Com- 
mission for Research in Economics, presented a paper entitled Statistical Infer- 
ence in Dynamic Economic Models. Dr. T. W. Andegson, Jr. presented a paper 
written by himself and Mr. Herman Rubin entitled Estimation of Structural 
Equations through Linear Transformation of Regression Coefficients. The meet- 
ing concluded with a discussion of these papers. 

P.S. Dwyer, 
Secretary. 











REPORT OF THE PRINCETON MEETING OF THE INSTITUTE 


The twenty-third meeting of the Institute of Mathematical Statistics was 
held in Princeton, New Jersey on Friday, November 1, 1946, in connection with 
the year-long Celebration of the Bicentennial of Princeton University. The 
meeting was devoted entirely to Analysis of Variance. The meeting was at- 
tended by 118 persons including the following 96 members of the Institute: 


Adam Abruzzi, Forman 8. Acton, R. L. Anderson, T. W. Anderson, Jr., M. 8. Bartlett, 
Robert Bechofer, Gilbert W. Beebe, J. H. Bigelow, Archie Blake, C. I. Bliss, A. E. Brandt, 
Burton H. Camp, George C. Campbell, A. George Carlton, Kai Lai Chung, W. G. Cochran, 
Gertrude Cox, Harold Cramér, 8. Lee Crump, J. H. Curtiss, Joseph F. Daly, Besse B. Day, 
D. B. DeLury, V. V. Divatia, J. Dutka, Churchill Eisenhart, B. Epstein, H. L. Fanshaw, 
Nicholas Fattu, Will Feller, Merrill M. Flood, Bernard Friedman, Hilda Geiringer, H. H. 
Goldstine, Joseph A. Greenwood, E. J. Gumbel, Margaret Gurney, L. Gutmann, T. E. 
Harris, Millard Hastay, Irwin 8S. Hoffer, C. J. Kirchen, B. F. Kimball, Lila F. Knudsen, 
H. 8S. Konijn, Jack Laderman, J. D. Maddrill, Sophie Marcuse, H. C. Mathisen, J. W. 
Mauchly, Margaret Merrell, Elmer B. Mode, Margaret E. Moore, J. E. Morton, Judith 
Moss, F. Mosteller, Charles M. Mottley. Ray B. Murphy, P. M. Neurath, Hugo Nilson, 
Gottfried E. Noether, Monroe L. Norden, H. W. Norton, C. O. Oakley, P. S. Olmstead, 
J.G. Osborne, Ellis R. Ott, C, J. Rees, W. A. Reynolds, A. C. Rosander, David Rosenblatt, 
Ernest Rubin, P. U. Rulon, Frank Saidel, Marian M. Sandomire, Walter A. Shewhart, 
James G. Smith, Milton Sobel, Herbert Solomon, Mortiner Spiegelmon, Charles M. Stein, 
G. R. Stibitz, John R. Tomlinson, Marion M. Torrey, John W. Tukey, D. F. Votaw, Jr., 
F. M. Wadley, Alton J. Wadman, A. Wald, Robert M. Walter, Lionel Weiss, Frank Wil- 
coxon, 8. 8. Wilks, C. P. Winsor, J. Wolfowitz, and W. J. Youden. 


At the morning session the following program was presented with Professor 


S.S. Wilks of Princeton University as chairman: 
Topic: Mathematical Approaches to the Analysis of Variance 
Papers: Two Probability Models for the Analysis of Variance 


Professor A. Wald, Columbia University 
Applications of Analysis of Variance 
Professor M. 8. Bartlett, Cambridge University and The University of 
North Carolina 
Discussion: Professor S. L. Crump, Iowa State College 
Dr. J. F. Daly, Bureau of the Census 
Professor J. W. Tukey, Princeton University 
Professor C. P. Winsor, Johns Hopkins University 
Professor J. Wolfowitz, Columbia University 


The program for the afternoon session, under the chairmanship of Professor 
Will Feller of Cornell University, was as follows: 


Topic: Multivariate Problems in the Analysis of Variance 


Papers: Analysis of Covariance 
Professor W. G. Cochran, The University of North Carolina 
Vector Methods 
Professor J. W. Tukey, Princeton University 
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Discussion: Professor T. W. Anderson, Columbia University 
Professor C. I. Bliss, Yale University 
Professor Harold Cramér, The University of Stockholm and Princeton 
University 
Professor D. B. DeLury, Virginia Polytechnic Institute 
Professor P. L. Hsu, The University of North Carolina 


The evening session consisted of round table discussion on Unsolved Problems 
of the Analysis of Variance, with Professor Gertrude M. Cox as chairman. 

Members of the Institute and others who attended the meeting were guests 
of the Institute for Advanced Study at tea in Fuld Hall from 4to 6 P.M. Those 
attending the evening session were guests of Princeton members’‘of the Institute 
for refreshments in Fine Hall from 10 to 11 P.M. 

P. 8. Dwyer, 
Secretary. 











MEMBERS OF THE INSTITUTE OF MATHEMATICAL STATISTICS* 


(As of October 1, 1946) 
(The names of Fellows of the Institute are designated by * and Life Members by f) 


*Abbey, Helen M.A. (Michigan) Stat., Bur. of Records and Stat., Mich. Dept. of Health, 
Lansing, Mich., 916 N. Chestnut 

Acerboni, Prof. Argentino V. Dr.Ec. (Buenos Aires) Facultad de C. Economicas, Buenos 
Aires, Argentina, Larroque 232, Banfield, Argentina 

Action, Forman Ch.E. (Princeton) Grad. Student, Carnegie Inst. of Tech., Pittsburgh, 
Pa., 101 W. Broadway, Salem, N. J. 

Aitchison, Beatrice Ph.D. (Johns Hopkins) Econ. and Stat. Analyst, Interstate Com- 
merce Comm., Wash. 25, D. C., 1929 S St., N.W., Wash. 9 

Alchian, Asst. Prof. Armen A. Ph.D. (Stanford) Econ. Dept., Univ. of Calif., Los 
Angeles, Calif. 
Alger, Philip L. M.S. (Union) Staff Asst. to Mgr. of Engr. Apparatus Dept., General 
Elec. Co., 1 River Rd., Schenectady, N. Y., 1758 Wendell Ave., Schenectady 8 
Allen, Prof. Roy G.D. D.Sc. (London) London School of Econ., Houghton St., Aldwych, 
London, W.C. 2, Eng., 11 Christchurch Pl., Epsom, Surrey 

Ailendoerfer, Prof. Carl B. Ph.D. (Princeton) Haverford Coll., Haverford, Pa., 750 
Rugby Rd., Bryn Mawr’ 

Alt, Franz L. Ph.D. (Vienna) Asst. Dir. of Res., Econometric Inst., 500 Fifth Ave., 
N. Y. 18, N. Y., 271 Fort Washington Ave., N. Y. 32 

Alter, Dinsmore Ph.D. (California) Dir., Griffith Observatory, Los Angeles, Calif. 

Anderson, Paul H. Ph.D. (Illinois) Economist, War Assets Adm., Wash., D. C., 1228 
Blair Mill Rd., Silver Spring, Md. 

Anderson, Asso. Prof. Richard L. Ph.D. (Iowa State) Inst. of Stat., N. C. State Coll., 
Raleigh, N. C., Box 5494, State Coll. Station 

Anderson, Theodore W., Jr. Ph.D. (Princeton) Instr., Dept. of Math. Stat., Columbia 
Univ., N. Y. 27, N. Y., 2435 Lawndale, Evanston, Ill. 

Angell, Dorothy T. Stat. Analyst, Bell Tel. Labs., Murray Hill, N. J. 

Anscombe, F. J. Rothamsted Experimental Station, Harpenden, Herts, Eng. 

Arias, B. Jorge C.E. (Guatemala) 3 Avenida Sur 65, Guatemala City, Guatemala, C. A. 

Arnold, Prof. Herbert E. Ph.D. (Yale) Wesleyan Univ., Middletown, Conn., 167 High 
St. 

Arnold, Asst. Prof. Kenneth J. Ph.D. (Mass. Inst. Tech.) Dept. of Math., Univ. of 
Wis., Madison 6, Wis., 733 E. Johnson St., Madison 3 

Aroian, Leo A. Ph.D. (Michigan) Instr., Hunter Coll., N. Y., N. Y., 247 Wadsworth 
Ave., N. Y. 88 

Arrow, Kenneth J. M.A. (Columbia) Lydig Fellow, Columbia Univ., N. Y. 27, N. Y., . 
749 West End Ave., N. Y. 25 

Astrachan, Asso. Prof. Max Ph.D. (Brown) Chm. of Math. Dept., Antioch Coll., 
Yellow Springs, Ohio, 204 E. Whiteman St. 

Auner, George J. B.A. (Western Reserve) Stat., Market Res. Dept., The Weatherhead 
Co., E. 131 St., Cleveland 8, Ohio, 12420 Iowa Ave., Cleveland 8 


* Members were asked to supply fresh information for this Directory. The name is 
followed by highest degree and Institution granting it. Then follow the professional and 
business connections of the member, with business address, and finally (in italics) the home 
or mail address. When an address is known to be in error it is followed by (last address). 
Changes in addresses or errors in names, titles, or addresses, should be reported to the 
Secretary. 
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Bachelor, Robert W. M.B.A. (Washington) Dir. of the Res. Council, American Bankers 
Assn., 12 E. 36th St., N. Y. 16, N. Y., 7 Jane Ave., Hartsdale 

Back, Kurt W. M.A. (California at L. A.) Stat., Surveillance Branch, Ballistic Res. 
Lab., Aberdeen Proving Gd., Md. 

Bacon, Asso. Prof. Harold M. Ph.D. (Stanford) Stanford Univ., Stanford, Calif., Box 
1144 

Baer, Prof. Reinhold Ph.D. (Géttingen) Dept. of Math., Univ. of Ill., Urbana, Ill. 

Bailey, Arthur L. B.S. (Michigan) Stat., American Mutual Alliance, 60 E. 42nd St., 
N. Y., N. Y., P.O. Box 278, Ramsey N.J. 

*Baker, Asst. Prof. George A. Ph.D. (Illinois) Asst. Prof. of Math. and Asst. Stat., 
Exp. Sta., Coll. of Agri., Univ. of Calif., Davis, Calif. 

Bal, Kenan Y. (Columbia) 500 Riverside Dr., N. Y. 27, N. Y. 

Baldwin, Woodson W. S.B. (Mass. Inst. Tech.) c/o Van Hook, 2812 Arline Ave., Ar- 

tesia, Calif. 
Bales, R. P. B.A.Sc. (Toronto) Tech. Supt., Dominion Rubber Co., St. Jerome, Que., 
Can. 
Bancroft, Asso. Prof. Theodore A. Ph.D. (Iowa State) Univ. of Ga., Athens, Ga. 
Barkan, Herbert M.A. (Columbia) Instr., Newark Coll. of Engineering, Newark, N. J., 
§ E. £0th St., Brooklyn, N. Y. 
Barnes, Jarvis M.A. (George Peabody Coll. for Teachers) Atlanta Board of Educ., 
14th Floor, City Hall, Atlanta, Ga. 
Barnes, Prof. John L. Ph.D. (Princeton) Chm., Dept. of Applied Math., Tufts Coll., 
Medford 55, Mass., 16 Ardley Rd., Winchester 
Barr, Prof. Arvil S. Ph.D. (Wisconsin) Univ. of Wis., Madison, Wis. 
Barral-Souto, Prof. Jose Sc.D. (Buenos Aires) Univ. of Buenos Aires, Buenos Aires, 
Argentina, Cordoba 1459 
*Bartky, Dean Walter Ph.D. (Chicago) Dean of Div. of Phys. Sciences, Univ. of Chicago, 
Chicago, Ill. 
*Bartlett, Prof. Maurice S. D.Sc. (London) Univ. of Cambridge, Cambridge, Eng., 137 
Chesterton Rd. (On leave, Inst. of State., Univ. of N. C., Chapel Hill, N. C.) 
*Baten, Prof. William D. Ph.D. (Michigan) Dept. of Math., Mich. State Coll., East 
Lansing, Mich., 411 Marshall 

Bates, Prof. O. Kenneth Sc.D. (Mass. Inst. Tech.) Head of Math. Dept., The St. 
Lawrence Univ., Canton, N. Y., 44 E. Main St. 

Battin, Asst. Prof. IsaacL. A.M. (Swarthmore) Drew Univ., Madison, N.J., 14 Glenwild 
Rd. 

Beall, Geoffrey Ph.D. (London) Res. Asso., Inst. of Paper Chemistry, Appleton, Wis., 
1602 N. Meade St. 

Bechhofer, Robert E. A.B. (Columbia) Grad. Student, Columbia Univ., N. Y., N. Y., 
181 Degraw Ave., Teaneck, N. J. 

Becker, Harold W. Electrical Instr., Navy Yard, Mare Island, Calif., 1214 N. 34 St., 
Omaha 3, Nebr. 

Beckstead, Gordon L. M.S. (Michigan) Consultant, Dietary Labs., San Diego, Calif., 
and Grad. Student, Univ. of Calif., Berkeley ; 5718 Huntington, Richmond 

Beebe, Gilbert W. Ph.D. (Columbia) Lt. AUS Control Div., Office of the Surgeon 
General, 1818 H St., N.W., Wash., D. C. 

Been, Richard O. M.A. (George Washington) Agric. Economist, Bur. of Agric. Econ., 
Wash. 25, D. C., 6142 MacArthur Blvd., N.W., Wash. 16 

Behrends, Stanley G. LL.B. (LaSalle) Stat., Calif. Wire Cloth Corp., 1001 22nd Ave., 
Oakland, Calif., 439 65th St., Oakland 9 

Bellinson, Harold R. S.M. (Mass. Inst. Tech.) Indus. Eng. War Dept., Pentagon Blig., 
Arlington, Va., 3416 B St., S.E., Wash. 19, D.C. 

Belz, Asso. Prof. Maurice H. M.A. (Melbourne) Univ. of Melbourne, Carlton, N. 3, 
Victoria, Australia 
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Benford, Frank B.E.E. (Michigan) Physicist, 1643 Rugby Rd., Schenectady 8, N. Y. 

Bennett, Prof. Albert A. Ph.D. (Princeton) Brown Univ., Providence, R. I. 

Bennett, Blair M. M.A. (Columbia) Res. Asst., U. S. Dept. of Agric., Wash., D. C., 
1410 M St., N.W. 

Bennett, Carl A. A.M. (Michigan) Grad. Fellow, Dept. of Math., Univ. of Mich., Ann 
Arbor, Mich., P.O. Box 3, Saline 

Berger, Richard M.A. (Columbia) Res. Analyst, Dun and Bradstreet, 290 Broadway, 
N. Y., N. Y., 24 Rugby Rd., Rockville Centre 

Berkson, Joseph M.D., D.Sc. (Johns Hopkins) Chief, Div. of Biometry and Med. Stat., 
Mayo Clinic, Rochester, Minn., 437 14th Ave., S.E. 

Berman, Abraham J. M.A. (Brooklyn) Stat., N. Y. State Dept. of Labor, 80 Center 
St., N. Y., N. Y., 1460 College Ave., Bronx 

Berwick, Leonard A.B. (New York) Capt., AC, AAF School of Aviation Medicine, 
Randolph Field, Texas 

Bickerstaff, Asst. Prof. Thomas A. M.A. (Mississippi) Univ. of Mississippi, State 
College, Miss. 

Bigelow, Julian H. 531 W. 122nd St., N. Y. 27, N. Y. 

Bingham, Marion D. B.A. (George Washington) Acting Chief, Res. Div., Veterans 
Adm. Branch Office No. 9, 314 N. Broadway, St. Louis, Mo., 3908 Hiler St. 

Birnbaum, Asso. Prof. Z. William Ph.D. (Lwow) Univ. of Wash., Seattle 5, Wash., 
5782 31st Ave., N.E. 

Blackadar, Walter L. F.A.S., F.A.I.A., B.A. (McMaster) Asso. Actuary, Equitable 
Life Assurance Soc. of the U. 8., 393 Seventh Ave., N. Y.1, N. Y., 40 Hillcrest Rd., 
R.D. ®2, Plainfield, N. J. 

Blackburn, Prof. Raymond F. Ph.D. (Pittsburgh) Head, Dept. of Stat., Univ. of Pitts- 

- burgh, Pittsburgh 13, Pa. 818 Country Club Dr., Pittsburgh 16 

Blackwell, Asso. Prof. David H. Ph.D. (Illinois) Howard Univ., Wash., D. C., 3723 
Jay Si., N.E. 

Blake, Archie Ph.D. (Chicago) Sr. Stat., Office of the Army Surgeon General, Wash. 
25, D. C., 3200 19th St., N.W., Wash. 10 

Blanche, Ernest E. Ph.D. (Illinois) Prin. Adm. Analyst, War Dept. General Staff, 
43665 Pentagon Bldg., Wash. D. C. 9409 Montgomery Avenue, Chevy Chase 15, Md. 

*Bliss, Chester I. Ph.D. (Columbia) Biometrician, Conn. Agric. Experiment Station, 
Asso. Prof. of Biometry, Yale Univ., New Haven, Conn., 23 Edgehill Rd. 

Blommers, Asst. Prof. Paul J. Ph.D. (Iowa) Asst. Prof. and Registrar, State University 
of Iowa, Iowa City, Iowa, 3 Woolf Court 

Bloom, Rose B.A. (Hunter) 1916 Grand Concourse, Bronx 57, N.Y. 

Bloom, Royal F. M.A. (Minnesota) Asst. Head, Classification Res., Bur. of Naval Per- 
sonnel, Navy Dept., Wash. 25, D. C., 13 Hillside, Greenbelt, Md. 

Boddie, John B. Chief, Govt. Sec., International Economics, Dept. of Commerce, Wash- 
ington, D. C. 2628 Tunlaw Rd., N.W., Wash. 7 

Bonis, Austin J. B.S. (C.C.N.Y.) Major, War Dept. Gen. Staff, Wash., D. C., 2500 
Que St., N.W. 

Bonnar, Robert U. M.S. (Washington) Chemist, Shell Development Co., 4560 Horton 
Ave., Emeryville, Calif., 2959 Brook Way, San Pablo 

Boozer, Mary E. A.M. (Chicago) 905 Park Ave., Richmond 25, Va. 

Borland, James M.A. (Indiana) 65722 Central Ave., Indianapolis, Ind. 

Boschan, Paul Ph.D. (Vienna) Chief Analyst, The Econometric Institute, Inc., 500 
Fifth Ave., N. Y. 18, N. Y., 104 West 46 St., N. Y. 18 

tBowen, Earl K. A.M. (Boston) Instr. in Stat., Babson Inst. of Bus. Adm., Babson 
Park, Wellesley Hills, Mass., 246 Union St., Norwood 

Bower, Oliver K. Ph.D. (Illinois) Asso., Univ. of Ill., Urbana, IIll., 505 W. John, 
Champaign 
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{Bowker, Albert H. S.B. (Mass. Inst. Tech.) Student, Univ. of North Carolina, Chapel 
Hill, N. C., 704 Gimghoul Rd. 
Brady, Dorothy S. Ph.D. (California) Chief, Cost of Living Div., Bureau of Labor 
Stat., Wash. 25, D. C., 5173 Fulton St., N.W., Washington 16 
Brandt, Alva E. Ph.D. (Iowa State) Res. Specialist in Experimental Design and Anal- 
ysis, Soil Conservation Service, Dept. of Agric., Wash. 25 D. C., Route 3 Box 136, 
Vienna, Va. 
Brearty, Charles R. B.S. (California) Member of Technical Staff, Bell Telephone Labs., 
Inc., 463 West St., N. Y. 14, N. Y., 256-20 41st Ave., Little Neck, N. Y. 
Breden, Robert E. B.S. (Kansas State) Supv., Analysis and Records Div., The Proctor 
and Gamble Co., Sixth and Main Sts., Cincinnati 1, O.,6438 Mayflower Ave., Cincinnati 
12 
Breen, Nancy Brixey (Mrs. J.P.) A.B. (Vassar) 70 East 77 St., N. Y.21, N.Y. 
Bresnahan, Maurice F. Stat., U. S. Bur. of Labor Stat., Washington, D. C., Apt. 305, 
1015 N St., N.W. Wash. 1 
Bridger, Clyde A. M.S. (Oregon State) Instr. in Math., Univ. of Utah, Salt Lake City, 
Utah (on leave), Instr. of Stat., State College Station, Raleigh, N. C. 
Brier, Glenn A.M. (George Washington) Meteorologist, U. S. Weather Bur., Wash. 
25, D.C. 
Brixey, Asso. Prof. John C. Ph.D. (Chicago) Univ. of Okla., Norman, Okla. 927 S. Pick- 
ard St., Norman 
Bronfenbrenner, Martin Ph.D. (Chicago) Lt. (j.g.) USNR, Office of CINPAC, c/o 
Postmaster, San Francisco, Calif. 728 N. First Ave., Tucson, Ariz. 
Brookner, Ralph J. Ph.D. (Columbia) 1390 Broadway, Beaumont, Texas 
Brooks, Alvin G. B.A. (Ripon) Chief of Inspection Tasks Sec., Western Elec. Co., 
Hawthorne Sta., Chicago, Ill. 4538 Lawn Avenue, Western Springs 
Brown, Asst. Prof. Arthur B. Ph.D. (Harvard) Queens Coll., Flushing, N. Y. 156-01 
90th Ave., Apt. 4F, Jamaica 2 
Brown, Arthur W. A.B. (Princeton) Standard Oil Co., Rm. 3101, 30 Rockefeller Plaza, 
N. Y., N. Y. 25 Wilmer St., Madison, N. J. 
Brown, Res. Asso. Prof. George W. Ph.D. (Princeton) Stat. Lab., Iowa State Coll. 
Ames, Iowa 
tBrown, Richard H. A.B. (Columbia) Lecturer in Math., Columbia Univ., 53¥ W. 116th 
St., N. Y. 27, N. Y., 1310 John Jay Hall, Columbia Univ., N. Y. 27, N.Y. 
Brown, Prof. Theedore H. Ph.D. (Yale) Dept. of Bus. Stat., Harvard Univ., Grad. 
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